-
Notifications
You must be signed in to change notification settings - Fork 71
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Refactor language utility data out from utils.py #52
Comments
I do like how this could facilitate the inclusion of new languages in the future as well 🙌 Instead of having to hunt down where all modifications would be needed in code, a lot of it can be centralized in this JSON file. @m-charlton we can absolutely go over what format might work best too, as you've already started the discussion for. Could using a gist though make it easier perhaps to collaborate? To make comments and revisions on the file? |
@wkyoshida thanks for opening this issue. The following gist contains the truncated version of the language data file (only two languages are included). The "used by" & "description" fields are comments,the data proper are held in "languages". My main question concerns the placement of the "ignore" & "remove" fields. Is the following to be preferred? "remove-words": [],
"ignore-words": [] A second question concerns the location of the resources directory, containing the JSON data file: currently it's in |
I'm thinking just having the structure of "remove-words": [],
"ignore-words": [] as you mentioned makes sense actually. Tying them under a |
Makes sense to me, @wkyoshida :) |
refactor(utils.py): move language data to JSON file (resolves #52)
951bf86 sent along some minor changes to fix what we discussed in the sync and also improved the docstrings in a few places. Thanks for the note that one was copied and not updated, @m-charlton! I think the next thing to do here would be #55 so we simplify this process even more. I noted in #54 what parts would be able to be removed :) |
Terms
Issue
This issue is for refactoring the language data out from src/scribe_data/utils.py to a separate file, likely a JSON - as suggested by @m-charlton in #51 🙌
As proposed by @m-charlton,
We can use this issue to track discussion of details for the implementation and the work itself.
The text was updated successfully, but these errors were encountered: