Yegane: Task 721-724 : DSTC2 and hate_speech18 Dataset #108

nisargpatel58 · 2021-09-15T06:31:58Z

No description provided.

swarooprm · 2021-09-18T22:00:13Z

@SavanDoshi, wondering if you can help review this PR?

swarooprm · 2021-09-22T19:49:51Z

@yeganehkordi, wondering if you can help review this PR?

yeganehkordi · 2021-09-22T19:57:26Z

Sure! I'll review them now.

yeganehkordi

Thanks for your work!
One note: Labels in task 721 look skewed towards the "0" class. Make sure the distribution is not skewed toward a class in all the tasks.

yeganehkordi · 2021-09-22T20:08:51Z

tasks/task721_hate_speech18_classification.json

+        "Nisarg Patel"
+    ],
+    "Source": [
+        "https://huggingface.co/datasets/hate_speech18"


Please add the dataset name in the source field of each task. (in addition to the link)

yeganehkordi · 2021-09-22T20:26:21Z

tasks/task721_hate_speech18_classification.json

+    "Categories": [
+        "Classification"
+    ],
+    "Definition": "Given a statement, determine if the sentiment is of hatred, no hate,relation or nothing at all.",


Please explain the numbers in the input. Anyone should be able to do the task by just reading the definition. You can ask one of your friends to read the definition and solve one of the instances.
Also, consider adding a space after comma in no hate,.

yeganehkordi · 2021-09-22T20:29:22Z

tasks/task721_hate_speech18_classification.json

+        {
+            "input": "Jeeze its worst than the UK.",
+            "output": "3",
+            "explanation": "Its a relative comparision with United Kingdom."


Please correct the typo of the It's, and replace United Kingdom with the United Kingdom. Consider adding a comma before and, in the previous explanation.

yeganehkordi · 2021-09-22T20:48:28Z

tasks/task723_DSTC2_classification.json

+    "Categories": [
+        "Classification"
+    ],
+    "Definition": "Given the customer side of conversation, determine what the goal of the customer is.",


Please elaborate more on the definition. For example, you can say that each part of the conversation is indicated with a "\n" and a number, and all the enquires of the customer should be reflected in the output.

yeganehkordi · 2021-09-22T20:49:18Z

tasks/task723_DSTC2_classification.json

+        {
+            "input": "0.noise\n1.cheap restaurant in the east part of town\n2.phone number\n3.noise\n4.phone number\n5.good bye\n",
+            "output": "You want to find a cheap restaurant and it should be in the east part of town. You want to know the phone number.",
+            "explanation": "All the enquires of the customer with every detail is reflected"


Consider adding a period at the end of the sentence.

yeganehkordi · 2021-09-22T20:54:09Z

tasks/task723_DSTC2_classification.json

+        {
+            "input": "0.traditional food\n1.traditional\n2.spanish food\n3.spanish\n4.food\n5.address\n6.price range\n7.thank you good bye\n",
+            "output": "You want to find and it should serve african food. You don't care about the price range. Make sure you get the address, phone number, and area of the venue.",
+            "explanation": "The food type did not match and the price range was mis calculated."


Please correct the typo of the miscalculated.
Maybe change the explanation of the example to: The defined goals do not match the requirements of the customer.

yeganehkordi · 2021-09-22T21:00:47Z

tasks/task724_DSTC2_classification.json

+    "Categories": [
+        "Classification"
+    ],
+    "Definition": "Given the reponses by restaurant system, determine the experience of the customer understanding as strongly agree, agree,slightly agree,slightly disagree and strongly disagree.",


Please correct the typo of the responses, consider adding a space after commas, and elaborate more on the definition as explained in the previous task.
I suggest merging strongly agree, agree, and slightly agree into agree and the others into disagree because the border between them might be fuzzy. Hence, average humans might not be able to clearly distinguish them.

yeganehkordi · 2021-09-22T21:16:08Z

tasks/README.md

+`task722_DSTC2_classification` | Classify if the speaker is a customer or restaurant system  | Classification  
+`task723_DSTC2_classification` | Determine the customer goals from the given customer side of conversation  | Classification  
+`task724_DSTC2_classification` | Classify the experience of the speaker understanding in terms of strongly agree, agree,slightly agree,slightly disagree and strongly disagree. | Classification  


Maybe change to: Given the responses by a restaurant system, classify the experience of the customer understanding.
Please consider rewriting the summaries in this format.

swarooprm · 2021-09-25T17:02:28Z

@nisargpatel58, can you address reviewer comments so that I can evaluate?

yeganehkordi · 2021-09-30T17:58:10Z

A kind reminder if you have forgotten this task.

swarooprm · 2021-10-07T00:58:00Z

A kind reminder if you have forgotten this task.

I assume you are not planning to revise this PR. @nisargpatel58

yeganehkordi

Thanks! Some remaining comments:

Please change the source fields to something like "Hate Speech Recognition (https://huggingface.co/datasets/hate_speech18)". Since you have used one dataset, the list of sources should have one item.
Labels in task 721 look skewed towards the "0" class. I guess task 724 has the same problem. Make sure the distribution is not skewed toward a class in all the tasks.

yeganehkordi · 2021-10-07T21:34:57Z

tasks/task723_DSTC2_classification.json

        }
    ],
    "Negative Examples": [
        {
            "input": "0.traditional food\n1.traditional\n2.spanish food\n3.spanish\n4.food\n5.address\n6.price range\n7.thank you good bye\n",
            "output": "You want to find and it should serve african food. You don't care about the price range. Make sure you get the address, phone number, and area of the venue.",
-            "explanation": "The food type did not match and the price range was mis calculated."
+            "explanation": "The food type did not match and the price range was miscalculated."
        },
        {
            "input": "0.im looking for something in the west side\n1.doesnt matter as long as it is moderately priced\n2.can i have the address of it\n3.whats its phone number\n4.thank you\n5.goodbye\n",


Maybe change the explanation of this example(second negative example) to: The defined goals do not match the requirements of the customer.

The distribution of the data in both these datasets are skewed in nature. The overall distribution of task 721 falls highly under class 0. It's like 180:20 ( 0 : other classes) kind of distribution. For 724, there is the same thing but I will try to make it a little less skewed.

Thanks! We prefer not to have skewness, even though the data is skewed in nature.

swarooprm · 2021-10-29T03:55:23Z

Last reminder @nisargpatel58 to update this PR.
I will grade tomorrow.

yeganehkordi · 2021-10-29T22:08:16Z

@nisargpatel58 Please check the files. I guess you have pushed the first version of the tasks before addressing the comments.

nisargpatel58 · 2021-10-30T03:45:05Z

I guess just the source field names were not formatted. Rest all changes about skewed data and grammatical errors have been taken care of.

yeganehkordi

I saw that you had addressed these comments once, but now it seems that you have pushed the first version. Here are two examples. Would you please recheck the comments?

yeganehkordi · 2021-10-30T09:25:26Z

tasks/task721_hate_speech18_classification.json

+    "Categories": [
+        "Classification"
+    ],
+    "Definition": "Given a statement, determine if the sentiment is of hatred, no hate,relation or nothing at all.",


Please explain the numbers in the input. Anyone should be able to do the task by just reading the definition. You can ask one of your friends to read the definition and solve one of the instances.
Also, consider adding a space after comma in no hate,.

yeganehkordi · 2021-10-30T09:26:08Z

tasks/task721_hate_speech18_classification.json

+        {
+            "input": "Jeeze its worst than the UK.",
+            "output": "3",
+            "explanation": "Its a relative comparision with United Kingdom."


Please correct the typo of the It's, and replace United Kingdom with the United Kingdom. Consider adding a comma before and, in the previous explanation.

swarooprm · 2021-11-02T11:30:59Z

grading complete

Adding the 4 tasks

3fc1d6c

swarooprm changed the title ~~Task 721-724 : DSTC2 and hate_speech18 Dataset~~ Yegane: Task 721-724 : DSTC2 and hate_speech18 Dataset Sep 22, 2021

yeganehkordi reviewed Sep 22, 2021

View reviewed changes

Made requested changed.

1e611b1

yeganehkordi reviewed Oct 7, 2021

View reviewed changes

Changes in 721, 723 and 724.

5706c8c

Corrected the Source field for all 4 tasks.

aab8ead

yeganehkordi reviewed Oct 30, 2021

View reviewed changes

Changes in 721

2692485

swarooprm mentioned this pull request Nov 2, 2021

Tasks 1745-1756 allenai/natural-instructions#554

Draft

swarooprm closed this Nov 2, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Yegane: Task 721-724 : DSTC2 and hate_speech18 Dataset #108

Yegane: Task 721-724 : DSTC2 and hate_speech18 Dataset #108

nisargpatel58 commented Sep 15, 2021

swarooprm commented Sep 18, 2021

swarooprm commented Sep 22, 2021

yeganehkordi commented Sep 22, 2021

yeganehkordi left a comment

yeganehkordi Sep 22, 2021 •

edited

Loading

yeganehkordi Sep 22, 2021

yeganehkordi Sep 22, 2021 •

edited

Loading

yeganehkordi Sep 22, 2021

yeganehkordi Sep 22, 2021

yeganehkordi Sep 22, 2021

yeganehkordi Sep 22, 2021

yeganehkordi Sep 22, 2021

swarooprm commented Sep 25, 2021

yeganehkordi commented Sep 30, 2021

swarooprm commented Oct 7, 2021

yeganehkordi left a comment

yeganehkordi Oct 7, 2021

nisargpatel58 Oct 7, 2021

yeganehkordi Oct 9, 2021

swarooprm commented Oct 29, 2021

yeganehkordi commented Oct 29, 2021

nisargpatel58 commented Oct 30, 2021

yeganehkordi left a comment

yeganehkordi Oct 30, 2021

yeganehkordi Oct 30, 2021

swarooprm commented Nov 2, 2021

Yegane: Task 721-724 : DSTC2 and hate_speech18 Dataset #108

Yegane: Task 721-724 : DSTC2 and hate_speech18 Dataset #108

Conversation

nisargpatel58 commented Sep 15, 2021

swarooprm commented Sep 18, 2021

swarooprm commented Sep 22, 2021

yeganehkordi commented Sep 22, 2021

yeganehkordi left a comment

Choose a reason for hiding this comment

yeganehkordi Sep 22, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

yeganehkordi Sep 22, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

swarooprm commented Sep 25, 2021

yeganehkordi commented Sep 30, 2021

swarooprm commented Oct 7, 2021

yeganehkordi left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

swarooprm commented Oct 29, 2021

yeganehkordi commented Oct 29, 2021

nisargpatel58 commented Oct 30, 2021

yeganehkordi left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

swarooprm commented Nov 2, 2021

yeganehkordi Sep 22, 2021 •

edited

Loading

yeganehkordi Sep 22, 2021 •

edited

Loading