Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Better exception #355

Merged
merged 3 commits into from
Sep 23, 2022
Merged

Better exception #355

merged 3 commits into from
Sep 23, 2022

Conversation

andamian
Copy link
Contributor

Possible solution for #352

@andamian andamian marked this pull request as draft September 14, 2022 22:03
@codecov
Copy link

codecov bot commented Sep 14, 2022

Codecov Report

Merging #355 (e47c049) into main (907b164) will increase coverage by 0.05%.
The diff coverage is 61.90%.

❗ Current head e47c049 differs from pull request most recent head 2dc0bdf. Consider uploading reports for the commit 2dc0bdf to get more accurate results

@@            Coverage Diff             @@
##             main     #355      +/-   ##
==========================================
+ Coverage   78.36%   78.42%   +0.05%     
==========================================
  Files          46       46              
  Lines        5506     5520      +14     
==========================================
+ Hits         4315     4329      +14     
  Misses       1191     1191              
Impacted Files Coverage Δ
pyvo/dal/adhoc.py 66.19% <33.33%> (-0.40%) ⬇️
pyvo/dal/query.py 84.77% <50.00%> (-0.40%) ⬇️
pyvo/dal/exceptions.py 85.00% <77.77%> (+6.62%) ⬆️
pyvo/dal/tap.py 70.98% <100.00%> (+0.36%) ⬆️

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

@andamian
Copy link
Contributor Author

Is the solution in this draft acceptable @msdemlei?
It turns the trace reported in the issue into:

Traceback (most recent call last):
  File "/Users/adriand/Documents/work/github/pyvo/pyvo/dal/adhoc.py", line 252, in getdataset
    response.raise_for_status()
  File "/Users/adriand/Documents/work/github/pyvo/.tox/py39-test/lib/python3.9/site-packages/requests/models.py", line 960, in raise_for_status
    raise HTTPError(http_error_msg, response=self)
requests.exceptions.HTTPError: 403 Client Error:  for url: https://almascience.eso.org/dataPortal/member.uid___A001_X14c3_X125d.J1743-0350_ph.spw16.mfs.I.mask.fits.gz

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "/Users/adriand/Documents/work/github/pyvo/pyvo/dal/adhoc.py", line 254, in getdataset
    raise DALServiceError.from_except(ex, url)
pyvo.dal.exceptions.DALServiceError: 403 Forbidden: anonymous cannot access member.uid___A001_X14c3_X125d.J1743-0350_ph.spw16.mfs.I.mask.fits.gz for https://almascience.eso.org/dataPortal/member.uid___A001_X14c3_X125d.J1743-0350_ph.spw16.mfs.I.mask.fits.gz

This is just a draft to get an idea if I'm on the right track. It returns a different exception for existing clients that are coded to expect the first one.

I've also improved TAP error messages.

I couldn't find any service to return the payload of the error message in VOTable format. CADC TAP or SIA ones do that but for sync calls only. PyVO hits the async end points even for sync queries (which does not make sense).

@bsipocz
Copy link
Member

bsipocz commented Sep 14, 2022

PyVO hits the async end points even for sync queries (which does not make sense).

smells like a bug?

@andamian
Copy link
Contributor Author

I could very much be the case. I don't understand why in this particular case it was leaking the requests exception when in the other cases it would wrap it in its own exception. This fix uses the simple text payload in the exception message.

@andamian andamian added the bug label Sep 14, 2022
@msdemlei
Copy link
Contributor

msdemlei commented Sep 15, 2022 via email

@tomdonaldson tomdonaldson added this to the v1.4 milestone Sep 16, 2022
Copy link
Contributor

@tomdonaldson tomdonaldson left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks, @andamian . Other than my one comment and the change log, I'm happy with this change. I suspect the test coverage will be difficult to improve so I'm not worried about that.

pyvo/dal/tap.py Outdated
@@ -821,7 +821,7 @@ def raise_if_error(self):
if theres an error
"""
if self.phase in {"ERROR", "ABORTED"}:
raise DALQueryError("Query Error", self.phase, self.url)
raise DALQueryError("Query Error: " + self.job.message, self.phase, self.url)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Two thoughts here.

  1. I couldn't convince myself quickly that self.job.message would have a value here. Would it be worth checking for that first?

  2. I have a general concern with the use of the job and phase properties here. My real problem is with how those "properties" are defined, but I don't suggest any changes there for this PR. Those "properties" are both written to do an _update() every time they are accessed, and _update() hits network at the TAP service job URL. So for these two lines, we would do 3 GETs of the same information just to craft an error message. That's not efficient and if the error is related to communication problems with the service, then this makes it worse.

One of the reasons I don't like these properties is that there's no good way to be sure it's safe to use the underscore version (i.e., _job, _phase which don't do _updates()). In this case I think it would be safe to use the underscore versions on all 3 accesses, but it certain to be safe to leave the self.phase in the if and use self._job... and self._phase in the error call since the phase check would have called _update().

Copy link
Contributor Author

@andamian andamian Sep 20, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@tomdonaldson for 1:

tap = pyvo.dal.TAPService("https://ws.cadc-ccda.hia-iha.nrc-cnrc.gc.ca/youcat")
tap.run_async("select top 2 * from caom2.Observations")

Currently, the error is:

  File "/Users/adriand/Documents/work/github/pyvo/pyvo/dal/tap.py", line 824, in raise_if_error
    raise DALQueryError("Query Error ", self.url)
pyvo.dal.exceptions.DALQueryError: Query Error 

The the job error message:

pyvo.dal.exceptions.DALQueryError: Query Error: IllegalArgumentException:Table [ caom2.Observations ] is not found in TapSchema. Possible reasons: table does not exist or permission is denied.

which is an improvement.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think this might address #125 too

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

No 2. has an issue now #365

@msdemlei
Copy link
Contributor

msdemlei commented Sep 19, 2022 via email

@tomdonaldson
Copy link
Contributor

On Fri, Sep 16, 2022 at 01:40:03PM -0700, Tom wrote: properties here. My real problem is with how those "properties" are defined, but I don't suggest any changes there for this PR. Those "properties" are both written to do an _update() every time they are accessed, and _update() hits network at the TAP service job URL. So for these two lines, we would do 3 GETs of the same information just to craft an error message. That's not efficient and if the error is related to communication problems with the service, then this makes it worse.
Uh. I agree with that. I suppose as a general rule, I'd now say a simple property access should never hit the network, as that's expensive and there's so much that can go wrong, whereas a property access in python looks inoccous. But what's done is done, and we probably shouldn't change the API here, at least not until pyvo 2. I suppose the quick solution here is to write something like: cur_phase = self.phase if phase in {"ERROR", "ABORTED"}: raise DALQueryError("Query Error: " + self.job.message, phase, self.url) That's down to two network accesses. Still not pretty given the server may to totally out of whack, but some progress. Longer-term, I suspect for cases like this it would be nice to have a context manager that would retrieve the job resource once and then return an object exposing them as they were at that time. I'm thinking of something like: with job.get_from_remote() as frozen: if frozen.phase ... If other people also think that's a good idea, I might try my hand on something like this.

I agree that we can't/shouldn't change the underlying behavior now. Longer-term changes should be made carefully, but I don't yet have a sense for the "best" approach. I think a context manager like you suggest would be helpful, but the scope and benefit would be limited to that with. There may be broader rules we could follow based on the state, but it's not obvious where that behavior would live (e.g., if the PHASE is in some completed state, is there ever a reason to ask the server again?) Another question might be, should the API user ever get the choice of forcing or preventing a refresh?

For this limited case, since _update() updates all the job elements, I think it's safe to use self._job in the error generation. Since it's on self it's not a privacy violation, it's just choosing to use the cached value (like being in the context manager, but less explicitly). Based on where raise_if_error() is called, we could probably get away with using _phase as well, but that's hacky and less safe since we can't guarantee where raise_if_error() will be called in the future.

@msdemlei
Copy link
Contributor

msdemlei commented Sep 20, 2022 via email

@andamian andamian marked this pull request as ready for review September 20, 2022 22:24
@andamian
Copy link
Contributor Author

@bsipocz @tomdonaldson - I think this one is ready.

Copy link
Member

@bsipocz bsipocz left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, one remaining tiny question that I didn't go into to dig up the answer for myself.

pyvo/dal/exceptions.py Show resolved Hide resolved
@bsipocz
Copy link
Member

bsipocz commented Sep 21, 2022

While this is waiting for another approval, I've rebased it to make sure it passes the remote and doctests.

Copy link
Contributor

@tomdonaldson tomdonaldson left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This looks great. Thanks @andamian !

@bsipocz bsipocz merged commit 3191766 into astropy:main Sep 23, 2022
@bsipocz
Copy link
Member

bsipocz commented Sep 23, 2022

Thanks @andamian!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants