NotificationsYou must be signed in to change notification settings
Fork324
Star2.3k

Diverse Mini-batch Active Learning#134

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to ourterms of service andprivacy statement. We’ll occasionally send you account related emails.

Already on GitHub?Sign in to your account

Jump to bottom

Open

mbrine555 wants to merge6 commits intomodAL-python:dev

base:dev

Choose a base branch

frommbrine555:feature/diverse-mini-batch

Open

Diverse Mini-batch Active Learning#134

mbrine555 wants to merge6 commits intomodAL-python:devfrommbrine555:feature/diverse-mini-batch

Conversation

Copy link

mbrine555 commentedJun 22, 2021•
edited
Loading

This is a PR that implements a new batch active learning query strategy (as mentioned in#119).Diverse Mini-batch Active Learning attempts to take into account both informativeness and diversity when selecting a batch of new examples to be labeled. It's also worth noting that this involves bumping the requiredscikit-learn version from0.18 -> 0.20.

I'm not sure if there's any additional documentation you'd like to have added around this, so just let me know!

mbriner added6 commits

June 19, 2021 12:16

sketch out diverse k-means implementation

9048f43

update some comment and docstrings

81c349e

make filter_param actually do something

0f89ab3

update tests

f2321cb

bump scikit-learn version to support weighted kmeans

22fadce

modify to return points closest to each cluster center, not just any …

52afb6a

…cluster center

damienlancry reviewed

Oct 10, 2021

View reviewed changes

modAL/batch.py

		Returns:
		Indices of the instances from `X` chosen to be labelled
		"""
		uncertainty = classifier_margin(classifier, X, **uncertainty_measure_kwargs)

Copy link

Contributor

damienlancryOct 10, 2021

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

so you only support margin uncertainty? I would suggest to add the callable as param of the function, and default to classifier_margin.

damienlancry reviewed

Oct 10, 2021

View reviewed changes

modAL/batch.py


		# Limit data set based on n_instances and filter_param
		record_limit = filter_param * n_instances
		keep_args = np.argsort(uncertainty_scores)[-record_limit:]

Copy link

Contributor

damienlancryOct 10, 2021

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

argsort is suboptimal in this case because we only need to partition at therecord_limitth instance.
argpartition is better suited for that. it is O(n) as opposed to O(nlog(n)) for argsort. you can use multi_argmax, or shuffled_argmax already implemented in selection.py

Labels

None yet

2 participants

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Diverse Mini-batch Active Learning#134

Are you sure you want to change the base?

Diverse Mini-batch Active Learning#134

Uh oh!

Conversation

mbrine555 commentedJun 22, 2021•
edited
Loading

Uh oh!

Uh oh!

damienlancryOct 10, 2021

Choose a reason for hiding this comment

Uh oh!

damienlancryOct 10, 2021

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Movatterモバイル変換

Diverse Mini-batch Active Learning#134

Are you sure you want to change the base?

Diverse Mini-batch Active Learning#134

Uh oh!

Conversation

mbrine555 commentedJun 22, 2021• editedLoading Uh oh!There was an error while loading.Please reload this page.

Uh oh!

Uh oh!

damienlancryOct 10, 2021

Choose a reason for hiding this comment

Uh oh!

damienlancryOct 10, 2021

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mbrine555 commentedJun 22, 2021•
edited
Loading