- Notifications
You must be signed in to change notification settings - Fork321
fix: avoid possible job already exists error#751
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to ourterms of service andprivacy statement. We’ll occasionally send you account related emails.
Already on GitHub?Sign in to your account
Uh oh!
There was an error while loading.Please reload this page.
Conversation
If job create request fails, a query job might still have startedsuccessfully. This commit handles this edge case and returns suchquery job one can be found.
Uh oh!
There was an error while loading.Please reload this page.
Uh oh!
There was an error while loading.Please reload this page.
plamut commentedJul 12, 2021
plamut commentedJul 14, 2021 • edited
Loading Uh oh!
There was an error while loading.Please reload this page.
edited
Uh oh!
There was an error while loading.Please reload this page.
The docs check failure does not seem to be related: Update: Indeed, butthe fix is on its way. |
tseaver commentedJul 14, 2021
googleapis/synthtool#1155 landed here in#762. I'm not sure why the config isn't making you merge with |
🤖 I have created a release \*beep\* \*boop\*---## [2.22.0](https://www.github.com/googleapis/python-bigquery/compare/v2.21.0...v2.22.0) (2021-07-19)### Features* add `LoadJobConfig.projection_fields` to select DATASTORE_BACKUP fields ([#736](https://www.github.com/googleapis/python-bigquery/issues/736)) ([c45a738](https://www.github.com/googleapis/python-bigquery/commit/c45a7380871af3dfbd3c45524cb606c60e1a01d1))* add standard sql table type, update scalar type enums ([#777](https://www.github.com/googleapis/python-bigquery/issues/777)) ([b8b5433](https://www.github.com/googleapis/python-bigquery/commit/b8b5433898ec881f8da1303614780a660d94733a))* add support for more detailed DML stats ([#758](https://www.github.com/googleapis/python-bigquery/issues/758)) ([36fe86f](https://www.github.com/googleapis/python-bigquery/commit/36fe86f41c1a8f46167284f752a6d6bbf886a04b))* add support for user defined Table View Functions ([#724](https://www.github.com/googleapis/python-bigquery/issues/724)) ([8c7b839](https://www.github.com/googleapis/python-bigquery/commit/8c7b839a6ac1491c1c3b6b0e8755f4b70ed72ee3))### Bug Fixes* avoid possible job already exists error ([#751](https://www.github.com/googleapis/python-bigquery/issues/751)) ([45b9308](https://www.github.com/googleapis/python-bigquery/commit/45b93089f5398740413104285cc8acfd5ebc9c08))### Dependencies* allow 2.x versions of `google-api-core`, `google-cloud-core`, `google-resumable-media` ([#770](https://www.github.com/googleapis/python-bigquery/issues/770)) ([87a09fa](https://www.github.com/googleapis/python-bigquery/commit/87a09fa3f2a9ab35728a1ac925f9d5f2e6616c65))### Documentation* add loading data from Firestore backup sample ([#737](https://www.github.com/googleapis/python-bigquery/issues/737)) ([22fd848](https://www.github.com/googleapis/python-bigquery/commit/22fd848cae4af1148040e1faa31dd15a4d674687))---This PR was generated with [Release Please](https://github.com/googleapis/release-please). See [documentation](https://github.com/googleapis/release-please#release-please).
| raisecreate_exc | ||
| try: | ||
| query_job=self.get_job( |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others.Learn more.
Hi, there is a slight problem with this change - self.get_job has a different return type to this function. It can return LoadJob, etc as well as the QueryJob we're expecting so the actual return type doesn't match what is declared for this function.
I don't understand the situations that could result in this code being called, but presumably in reality this would always be a QueryJob? Unfortunately this is causing me problems when running pylint over some code that calls this, because it thinks the function can return LoadJob, and that has a different set of members to QueryJob.
Many thanks,
Andrew Wilkinson
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others.Learn more.
Indeed, in this contextself.get_job() returns aQueryJob, becausejob_id is the same ID that was used a few lines above when constructing a new query job (and then starting it).
This project usespytype for static type checks and it did not complain, but apparentlypylint could not deduce the same and reported a false issue.
Could you tellpylint to ignore return type in that specific line wherequery() is called? IMHO that justifiable, becausepylint is wrong there.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others.Learn more.
Having looked into this a bit further I agree that pylint is wrong. It's a bit of a pain to have disable this check every time we call query, but I think this is a sign that pylint is aging and not keeping up with modern Python's type syntax.
Sorry for the noise.
Cheers,
Andrew
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others.Learn more.
No worries, it was a perfectly valid comment.
Ideally,pylint would allow ignoring particular warnings for lines matching a regex, but I'm not sure if that's currently supported? It would make disabling those false positives much cleaner compared to spamming the# pylint: disable=... comments all around the code.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others.Learn more.
Sadly the error isn't raised on the call toquery, but when you try and use the return value. In my case this is accessingnum_dml_affected_rows, which only exists onQueryJob, and not onLoadJob. Even if it did support disabling errors using a regex, I'm not sure it would be practical to create one.
It's been bugging me why this wouldn't be picked up by the type checker. I think I've tracked it down to the fact thatLoadJob,QueryJob, etc all derive from_AsyncJob, which in turn derives fromgoogle.api_core.future.polling.PollingFuture. The problem is thatgoogle.api_core.future.polling.PollingFuture is not typable, so it gets turns into anAny type, which makes all the job types equivalent and therefore doesn't generate an error. When testing with mypy you have to add# type: ignore to thePollingFuture import line explicitly, but I guesspytype is more forgiving.
I've create the attached file demonstrating the problem (annoyingly github won't let me attach the file as a .py). As currently written it'll generate an error in both mypy and pytype, but swap the comments on lines 5 and 6 and the error goes away.
Anyway, I have a reasonable workaround, so if you want to leave this that's absolutely fine. If in future thepython-api-core library adds typing then I expect this to break though. Adding anassert isinstance(query_job, job.QueryJob) will resolve the issue.
Cheers,
Andrew
invalid_return_union.txt
Fixes#738.
If job create request fails, a query job might still have started successfully. This PR handles this edge case and returns such
query job one can be found.
Based on thesimilar fix in the Java client.
PR checklist: