-
Notifications
You must be signed in to change notification settings - Fork 1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Inferencing of Features in FeatureView and timestamp column of DataSource #1523
Inferencing of Features in FeatureView and timestamp column of DataSource #1523
Conversation
Hi @mavysavydav. Thanks for your PR. I'm waiting for a feast-dev member to verify that this patch is reasonable to test. If it is, they should reply with Once the patch is verified, the new status will be reflected by the I understand the commands that are listed here. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
…More through testing needed. Signed-off-by: David Liu <[email protected]>
b397c1a
to
df1e9a3
Compare
/ok-to-test |
Thanks @mavysavydav, we will have a look. |
Seems reasonable. |
Signed-off-by: David Liu <[email protected]>
aba7fea
to
e59c561
Compare
Signed-off-by: David Liu <[email protected]>
About to push up new commit with test cases and some other minor fixes @jklegar. Feel free to wait on review until that commit arrives |
Signed-off-by: David Liu <[email protected]>
Signed-off-by: David Liu <[email protected]>
…ion test & added __ rule in inference Signed-off-by: David Liu <[email protected]>
Ok, tests have been added and ball is in ur court :D 🏀 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thank you David this is great! Left a few comments, most of them small - let me know if anything doesn't make sense
… test case for it Signed-off-by: David Liu <[email protected]>
d097efe
to
01fd199
Compare
thx for review! i agree w ur suggestions and have made the changes (including adding a test for checking that bigQuerySource is properly inferencing based on |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is great - a couple small followup comments and then should be good to go
sdk/python/feast/data_source.py
Outdated
list(zip(df["COLUMN_NAME"].to_list(), df["DATA_TYPE"].to_list())) | ||
) | ||
else: | ||
bq_columns_query = f"SELECT * FROM {self.query} LIMIT 1" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think we may need parentheses around {self.query} here
df, event_timestamp_column | ||
) | ||
return BigQuerySource( | ||
query=bq_source_using_table_ref.table_ref, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The query is supposed to be a select statement, so I think something like f"SELECT * FROM {bq_source_using_table_ref.table_ref}" would be better
expected = { | ||
("float_col", ValueType.DOUBLE), | ||
("int64_col", ValueType.INT64), | ||
("string_col", ValueType.STRING), | ||
} | ||
expected2 = { # parsing with the "query" param in bq sources uses different code path |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think we want both code paths to end up producing the same types, see also comment on types above
sdk/python/feast/type_map.py
Outdated
"INT64": ValueType.INT64, | ||
"STRING": ValueType.STRING, | ||
"FLOAT": ValueType.FLOAT, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
it's pretty weird on BigQuery's part that it's giving different types based on whether you query the INFORMATION_SCHEMA table or look at the field_type on schema_field; however, it seems INTEGER and INT64 are equivalent, and same for FLOAT and FLOAT64 - see https://cloud.google.com/bigquery/docs/reference/rest/v2/tables#TableFieldSchema.FIELDS.type.
So I think that here you can just have "INTEGER" map to the int64 value type and "FLOAT" map to the double value type, and this should also fix the mapping to different types in the test below
Signed-off-by: David Y Liu <[email protected]>
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: jklegar, mavysavydav The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
/kind feature |
/lgtm |
…urce (feast-dev#1523) * Implemented the inferencing. Did cursory runs to make sure it works. More through testing needed. Signed-off-by: David Liu <[email protected]> * fixed issue with mutable default argument in FeatureView Signed-off-by: David Liu <[email protected]> * Fix in example_feature_repo_with_inference.py file Signed-off-by: David Liu <[email protected]> * Added test cases and small fixes. Signed-off-by: David Liu <[email protected]> * fixed missing import with handling for lint error Signed-off-by: David Liu <[email protected]> * marked a test that needs bigquery client requesting to be an integration test & added __ rule in inference Signed-off-by: David Liu <[email protected]> * Code review corrections + BQSource Query arg handling + corresponding test case for it Signed-off-by: David Liu <[email protected]> * CR corrections Signed-off-by: David Y Liu <[email protected]> Co-authored-by: David Liu <[email protected]>
…urce (#1523) * Implemented the inferencing. Did cursory runs to make sure it works. More through testing needed. Signed-off-by: David Liu <[email protected]> * fixed issue with mutable default argument in FeatureView Signed-off-by: David Liu <[email protected]> * Fix in example_feature_repo_with_inference.py file Signed-off-by: David Liu <[email protected]> * Added test cases and small fixes. Signed-off-by: David Liu <[email protected]> * fixed missing import with handling for lint error Signed-off-by: David Liu <[email protected]> * marked a test that needs bigquery client requesting to be an integration test & added __ rule in inference Signed-off-by: David Liu <[email protected]> * Code review corrections + BQSource Query arg handling + corresponding test case for it Signed-off-by: David Liu <[email protected]> * CR corrections Signed-off-by: David Y Liu <[email protected]> Co-authored-by: David Liu <[email protected]>
Test cases written and also tested by running with CLI.
What this PR does / why we need it:
Milestone 1 of https://docs.google.com/document/d/1MkWvexE4e5nYWcQLELFnJ5o9OlJDKC2rn_USHMDT9dg/edit
Which issue(s) this PR fixes:
Fixes #1500
Does this PR introduce a user-facing change?: