cdpierse/transformers-interpretPublic

NotificationsYou must be signed in to change notification settings
Fork100
Star1.4k

Preferences for Zero Shot Classification Result Display#40

Answeredbylalitpagaria

cdpierse asked this question inQ&A

cdpierse

May 14, 2021

· 1 comments· 1 reply

AnsweredbylalitpagariaReturn to top

Discussion options

cdpierse
May 14, 2021
Maintainer

Hi everyone,

I'm currently working on an implementation of an explainer for zero shot classification tasks as previously discussed in#19.

I find myself at an interesting crossroads with regards to one particular design decision relating to the classification and how to display both the word attributions and visualization.

For those not familiar with the trick employed by Hugging Face to achieve zero shot classification the way this works is by exploiting the "entailment" label ofNLI models.

So if we have the sentence:

Today apple released the new Macbook showing off a range of new features found in the proprietary silicon chip computer.

And want to classify it with one the labels:

["sport", "technology", "current affairs"]

The way this explainer will work is similar to how the zero shot pipeline does in the transformers package - it will test out all three labels as hypothesis with the original text and measure which scores highest for entailment.

The hypothesis texts might look something like:

[CLS]Today apple released the new Macbook showing off a range of new features found in the proprietary silicon chip computer. [SEP] this text is about sport [SEP]
[CLS]Today apple released the new Macbook showing off a range of new features found in the proprietary silicon chip computer.[SEP] this text is about technology [SEP]
[CLS]Today apple released the new Macbook showing off a range of new features found in the proprietary silicon chip computer.[SEP] this text is about current affairs [SEP]

In this casetechnology would score highest.

So this brings me to the issue of this new explainer which is related to the "entailment trick" which is that there are two ways I can represent the classification:

how the model and explainer actually see the text which will include the hypothesis text.

a edited version that includes only the text to be classified, in this case I would also edit the attribution scores to only include the tokens being displayed/returned.

Of course I could make both of these options available via an argument of some sort to the method call but it still leaves me with the decision of which would be be the default.

I have my own preference but I'd love to hear some thoughts or suggestions on what seems the most natural choice to make here.

Thanks.

You must be logged in to vote

Answered by lalitpagaria

May 15, 2021

Thank you Charles for detailed explanation and looking into this.
To me [2] looks good but again it is personal choice :)

Not related to this task but just want to share another repo which showing visualisation in different ways. You might find interestinghttps://github.com/sergioburdisso/pyss3

View full answer

Replies: 1 comment 1 reply

Comment options

lalitpagaria
May 15, 2021

Thank you Charles for detailed explanation and looking into this.
To me [2] looks good but again it is personal choice :)

Not related to this task but just want to share another repo which showing visualisation in different ways. You might find interestinghttps://github.com/sergioburdisso/pyss3

You must be logged in to vote

1 reply

Comment options

cdpierse May 21, 2021
Maintainer Author

This is my preference too so I think I will go with this with the option to pass a parameter that allows for the entire sequence to be included. Thanks@lalitpagaria .

Answer selected bycdpierse

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Preferences for Zero Shot Classification Result Display#40

Uh oh!

{{title}}

Uh oh!

cdpierse
May 14, 2021
Maintainer

Replies: 1 comment 1 reply

Uh oh!

{{title}}

Uh oh!

lalitpagaria
May 15, 2021

Uh oh!

{{title}}

Uh oh!

cdpierse May 21, 2021
Maintainer Author

Select a reply

Uh oh!

Movatterモバイル変換

Preferences for Zero Shot Classification Result Display#40

Uh oh!

cdpierseMay 14, 2021 Maintainer

Replies: 1 comment· 1 reply

Uh oh!

lalitpagariaMay 15, 2021

Uh oh!

cdpierseMay 21, 2021 Maintainer Author

Uh oh!

cdpierse
May 14, 2021
Maintainer

Replies: 1 comment 1 reply

lalitpagaria
May 15, 2021

cdpierse May 21, 2021
Maintainer Author