Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add support for QQP dataset with unit tests #1713

Merged
merged 9 commits into from
May 18, 2022
Merged

Conversation

vcm2114
Copy link
Contributor

@vcm2114 vcm2114 commented May 11, 2022

Summary

  • Added support for QQP dataset
  • Added mocked unit tests for QQP dataset

Test

  • pytest test/datasets/test_qqp.py

Context

See #1710

Copy link
Contributor

@parmeet parmeet left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

Copy link
Contributor

@Nayef211 Nayef211 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. Can we also make sure to add the sharding filter to the dataset as @parmeet mentioned here #1710 (comment).

Btw I think we can land irrespective of the test failures coming from linux (since this is caused by some recent changes related to torchdata). As long as all the tests are passing on mac os.

@vcm2114 vcm2114 merged commit bd0f765 into pytorch:main May 18, 2022
file_name = "quora_duplicate_questions.tsv"
txt_file = os.path.join(base_dir, file_name)
mocked_data = []
print(txt_file)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can we remove the print statement here

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Solved with #1734

def setUpClass(cls):
super().setUpClass()
cls.root_dir = cls.get_base_temp_dir()
print(cls.root_dir)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Remove print

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Solved with #1734

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants