Movatterモバイル変換

bedevere-bot added the awaiting review label

add News entry

1f93322

oesteban added a commit to oesteban/nipype that referenced this pull request

[FIX] LegacyMultiProc hangs up indefenitely

3a813c5

This PR relates tonipy#2700, and should fix the problemunderlyingnipy#2548.I first considered adding a control thread that monitorsthe `Pool` of workers, but that would require a large overheadkeeping track of PIDs and polling very often.Just adding the core file of [bpo-22393](python/cpython#10441)shouldfixnipy#2548

oesteban mentioned this pull request

[FIX] LegacyMultiProc hangs up indefinitelynipy/nipype#2773

Closed

1 task

the-knights-who-say-ni added CLA signed and removed CLA not signed labels

Nov 12, 2018

stylistic fixes, avoid shadowingworker variable name

a172df6

effigies reviewed

Nov 12, 2018

Copy link

Contributor

effigies left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Just a couple comments, pending review from the cpython devs.

Lib/multiprocessing/pool.py OutdatedShow resolvedHide resolved

oesteban added3 commits

November 12, 2018 08:31

address some of@effigies' comments

4d614b3

protect changes of state of worker handler thread with lock

65f6eaf

Merge branch 'fix-issue-22393' of github.com:oesteban/cpython into fi…

6d9c4ca

…x-issue-22393

effigies reviewed

Nov 13, 2018

Lib/multiprocessing/pool.py OutdatedShow resolvedHide resolved

Copy link

Author

oesteban commentedDec 3, 2018

Hi@pitrou (or anyone with a say), can you give us a hint about the fate of this PR (even if you honestly think it does not have a very promising future).

Thanks

pitrou requested changes

Dec 15, 2018

Copy link

Member

pitrou left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Sorry for the delay@oesteban. I've made a couple of comments, you might want to address them.

Also, it seems you'll need to merge/rebase from master and fix any conflicts.

Lib/multiprocessing/pool.py OutdatedShow resolvedHide resolved

Lib/test/_test_multiprocessing.py OutdatedShow resolvedHide resolved

Lib/test/_test_multiprocessing.pyShow resolvedHide resolved

Doc/library/multiprocessing.rstShow resolvedHide resolved

Copy link

bedevere-bot commentedDec 15, 2018

A Python core developer has requested some changes be made to your pull request before we can consider merging it. If you could please address their requests along with any other requests in other reviews from core developers that would be appreciated.

Once you have made the requested changes, please leave a comment on this pull request containing the phraseI have made the requested changes; please review again. I will then notify any core developers who have left a review that you're ready for them to take another look at this pull request.

And if you don't make the requested changes,you will be poked with soft cushions!

bedevere-bot added awaiting changes and removed awaiting review labels

Dec 15, 2018

oesteban added2 commits

December 17, 2018 15:12

Merge remote-tracking branch 'upstream/master' into fix-issue-22393

706f178

address@pitrou's comments

933c77a

Copy link

Author

oesteban commentedDec 18, 2018

I have made the requested changes; please review again

bedevere-bot removed the awaiting changes label

Dec 18, 2018

bedevere-bot added the awaiting change review label

Dec 18, 2018

Copy link

bedevere-bot commentedDec 18, 2018

Thanks for making the requested changes!

@pitrou: please review the changes made to this pull request.

oesteban added2 commits

December 17, 2018 16:20

fix typo

efcc185

fix typo (sorry for the rebound commit)

7c21ddd

Copy link

Author

oesteban commentedFeb 7, 2019

pinging@pitrou, at least to know if the changes pointed at the right direction.

pitrou changed the title~~bpo-22393: FIX multiprocessing.Pool hangs if a worker process dies unexpectedly~~bpo-22393: Fix multiprocessing.Pool hangs if a worker process dies unexpectedly

Feb 7, 2019

Copy link

Member

pitrou commentedFeb 7, 2019

Sorry, will take a look again. Also@pablogsal you may be interested in this.

Copy link

Author

oesteban commentedMar 29, 2019

bumping up!

Copy link

Author

oesteban commentedMay 22, 2019•
edited
Loading

Are there any plans for deprecating multiprocessing? Otherwise, I think this bug should be addressed.

If the proposed fix is not the right way of fixing it, please let me know. I'll resolve the conflicts only once I know there is interest in doing so.

Thanks very much

Copy link

Member

pitrou commentedMay 22, 2019

@pierreglaser @tomMoral Would you like to take a look at this?

Copy link

Contributor

pierreglaser commentedMay 22, 2019

Yes I can have a look.

Copy link

Contributor

tomMoral commentedMay 23, 2019•
edited
Loading

I'll have a look too.

Copy link

Author

oesteban commentedMay 23, 2019

@pitrou thanks for the prompt response!

pierreglaser reviewed

May 24, 2019

Copy link

Contributor

pierreglaser left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Here is a first review.@tomMoral's one should land sometime next week :)


		class BrokenProcessPool(RuntimeError):
		"""
		Raised when a process in a ProcessPoolExecutor terminated abruptly

Copy link

Contributor

pierreglaserMay 24, 2019

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Maybe avoid usingProcessPoolExecutor andfuture terms, which are objects of theconcurrent.futures package and not themultiprocessing package.

		util.debug('terminate pool entering')
		is_broken = BROKEN in (task_handler._state,
		worker_handler._state,
		result_handler._state)

		worker_handler._state = TERMINATE

Copy link

Contributor

pierreglaserMay 24, 2019

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

No need to use the_worker_state_lock here? And in other places where_worker_handler._state is manipulated?

		util.debug('helping task handler/workers to finish')
		cls._help_stuff_finish(inqueue, task_handler, len(pool))
		else:
		util.debug('finishing BROKEN process pool')

Copy link

Contributor

pierreglaserMay 24, 2019

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

What happens here if thetask_handler is blocked, but we do not run_help_stuff_finish?


		err = BrokenProcessPool(
		'A worker in the pool terminated abruptly.')
		# Exhaust MapResult with errors

Copy link

Contributor

pierreglaserMay 24, 2019

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

This also applies toApplyResult right?