Skip to content
Projects
Groups
Snippets
Help
Loading...
Help
Submit feedback
Sign in
Toggle navigation
N
NLP-Project
Project
Project
Details
Activity
Releases
Cycle Analytics
Repository
Repository
Files
Commits
Branches
Tags
Contributors
Graph
Compare
Charts
Issues
0
Issues
0
List
Board
Labels
Milestones
Merge Requests
0
Merge Requests
0
CI / CD
CI / CD
Pipelines
Jobs
Schedules
Charts
Wiki
Wiki
Snippets
Snippets
Members
Members
Collapse sidebar
Close sidebar
Activity
Graph
Charts
Create a new issue
Jobs
Commits
Issue Boards
Open sidebar
almohanad.hafez
NLP-Project
Commits
8fe8e8a4
Commit
8fe8e8a4
authored
Nov 16, 2024
by
Almouhannad Hafez
Browse files
Options
Browse Files
Download
Email Patches
Plain Diff
(5) Add results
parent
1d492ae4
Changes
2
Show whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
29 additions
and
14 deletions
+29
-14
README.md
README.md
+29
-14
Results.xlsx
Results.xlsx
+0
-0
No files found.
README.md
View file @
8fe8e8a4
...
...
@@ -90,12 +90,25 @@
-
**Applying data augmentation on the original dataset, added 5 new rephrased rows for each original row using LLM `LLama3`**
![
Augmentation_effect
](
./images/Augmentation_effect.png
)
## ***Part4. Dependency tree***
**Files:**
> **`5.0.Process_texts_stanza.ipynb`**
-
**Applying stanza pipeline containing `tokenize,mwt,pos,lemma,depparse` and storing results for using later**
> **`5.1.Dep_parsing_classifier.ipynb`**
-
**Applying classification task using dep. tree features as tuples, this includes using:**
1.
`(head_word, dependent_word, dependency_relation)`
-
Syntactic inner relations between words in a sentence (Shallow parsing)
1.
`(head_pos, dependent_pos, dependency_relation)`
-
pos: Part Of Speech
1.
`(root_word, "ROOT")`
-
i.e. Head words for sentences
## ***Results***
> ***Using augmented dataset***
| Case
\\
Criterion | Accuracy(Train) | Accuracy(Test) | Precision(Test-Average) | Recall(Test-Average) | F1-Score(Test-Average) | Notes |
| ----------------------- | --------------- | -------------- | ----------------------- | -------------------- | ---------------------- | ------------------------- |
| -----------------------
---------------------------
| --------------- | -------------- | ----------------------- | -------------------- | ---------------------- | ------------------------- |
| nltk stemmer | 0.9629 | 0.9524 | 0.9513 | 0.9522 | 0.9509 | alpha=0.1, 300features |
| nltk lemmatizer | 0.9832 | 0.9699 | 0.9703 | 0.9699 | 0.9696 | alpha=0.1, 700features |
| Stanza lemmatizer | 0.9783 | 0.9671 | 0.9673 | 0.9672 | 0.9668 | alpha=0.1, 550features |
...
...
@@ -108,6 +121,8 @@
| Text + (1,4)Gram | 0.9967 | 0.9807 | 0.9802 | 0.9805 | 0.9802 | alpha=0.01, 12100features |
| Text + (2,3)Gram | 0.9951 | 0.9695 | 0.9688 | 0.9694 | 0.9688 | alpha=0.01, 9100features |
| Text + (2,4)Gram | 0.9951 | 0.9646 | 0.9634 | 0.9645 | 0.9635 | alpha=0.01, 14100features |
| Stanza Dep. Relation tuples | 0.9984 | 0.9781 | 0.9783 | 0.9784 | 0.9781 | alpha=0.01, 7000features |
| Stanza Dep.Relation+POS Relations+Headwords tuples | 0.9981 | 0.9747 | 0.9747 | 0.9749 | 0.9745 | alpha=0.01, 8000features |
---
> ***Applied features selection and model's hyperparameters tuning***
...
...
Results.xlsx
View file @
8fe8e8a4
No preview for this file type
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment