Classification Evaluation #17

Negiiiin · 2024-09-13T04:45:19Z

PR Type

[Feature]

Short Description

Added zero-shot and linear-probing classification evaluation (accuracy, f1-score and AUC)

This change is

mmlearn/tasks/classification.py

mmlearn/tasks/contrastive_pretraining.py

afkanpour

Reviewed 9 of 12 files at r1.
Reviewable status: 9 of 12 files reviewed, 16 unresolved discussions (waiting on @fcogidi and @Negiiiin)

mmlearn/datasets/core/dataset_info.py line 1 at r1 (raw file):

"""Saving the necessary imformstion about a dataset for evaluation tasks."""

typo: information.

mmlearn/datasets/core/dataset_info.py line 55 at r1 (raw file):

        return self._class_count

    def set_class_count(self, count: int) -> None:

Do we really need this setter? I imagine that for a dataset info object we need to set this once at initialization, and it won't change after that. Are there scenarios in which this is still needed?

mmlearn/datasets/core/dataset_info.py line 80 at r1 (raw file):

        return self._label_mapping

    def set_label_mapping(self, mapping: Dict[str, Any]) -> None:

Same comment.

mmlearn/datasets/core/dataset_info.py line 107 at r1 (raw file):

        return self._label_embedding

    def set_label_embedding(self, embedding: Dict[str, Any]) -> None:

same comment.

mmlearn/tasks/classification.py line 52 at r1 (raw file):

Previously, fcogidi (Franklin) wrote…

I think linear probing should be a separate task. It is not really a zero-shot task (the linear classifier needs to be trained), so it shouldn't inherit from EvaluationHooks.

I agree. Zero-shot classification is retrieval, whereas linear probing and regular fine-tuning classifications add a classification head on top of the encoder output. In linear-probing the encoder weights are frozen, and in fine-tuning classification all weights are fine-tuned. We should have a class for zero-shot and another for linear probing & fine-tuning.

mmlearn/tasks/classification.py line 70 at r1 (raw file):

        if self.mode == "zeroshot":

            class LabelDescriptionDataset(Dataset):

I'm not sure if defining classes inside conditional statements is a good software engineering practice. on_evaluation_epoch_start is a recurring function, keep in mind that this if statement runs multiple times. It doesn't make sense that the class definition is embedded here.

mmlearn/tasks/classification.py line 84 at r1 (raw file):

                    tokens = self.tokenizer(description)

                    example = Example(

Does this mean that this class only works for image-text data? Can we make the implementation more general so it works for any two modalities?

mmlearn/tasks/classification.py line 86 at r1 (raw file):

                    example = Example(
                        {
                            Modalities.RGB: torch.rand(3, 224, 224),

Hard-coding values like this is a big NO! Please use configs for such values.

Code quote:

3, 224, 224

mmlearn/tasks/classification.py line 110 at r1 (raw file):

            for name, dataset_info in all_dataset_info.items():
                descriptions = [
                    "This image has a sign of " + label

Hardcoding a prompt like this seems like a very specific type of zero-shot classification. Can we implement a more general class, and perhaps introduce a notion of "prompt" which could take a value like this string?

mmlearn/tasks/linear_probing_classification.py

mmlearn/tasks/classification.py

mmlearn/tasks/zero_shot_classification.py

projects/med_benchmarking/configs/experiment/linear_probing.yaml

mmlearn/tasks/zero_shot_classification.py

mmlearn/datasets/core/dataset_info.py

…ation

…m/Negiiiin/mmlearn into feature/classification_evaluation

…ion_evaluation

…che poetry dependencies

fcogidi

Reviewed 1 of 18 files at r5, 4 of 19 files at r6, 5 of 15 files at r7, 12 of 37 files at r8.
Reviewable status: 15 of 44 files reviewed, 8 unresolved discussions (waiting on @afkanpour and @Negiiiin)

fcogidi self-requested a review September 17, 2024 15:48

Negiiiin requested a review from afkanpour September 17, 2024 20:25

fcogidi requested changes Sep 18, 2024

View reviewed changes

fcogidi reviewed Sep 18, 2024

View reviewed changes

mmlearn/tasks/contrastive_pretraining.py Outdated Show resolved Hide resolved

afkanpour requested a review from fcogidi September 23, 2024 15:24

afkanpour requested changes Sep 23, 2024

View reviewed changes

Negiiiin requested a review from afkanpour September 24, 2024 15:01

fcogidi requested changes Sep 27, 2024

View reviewed changes

Negiiiin added 9 commits September 30, 2024 00:08

Resolved Merging Issues

a11d7e7

Used torch.metrics

735c345

Added set_all_dataset_info

a56a9c7

Debugged pre-commit errors

50aa88f

Adding new file

ddddae9

Separated zeroshot and linear-probing

480d883

Resolved pre-commit issues

66a29fd

Some changes in getting description embeddings

ed3f2de

Changed creating new metrics

cc07fc7

Negiiiin force-pushed the feature/classification_evaluation branch from d5b2e37 to cc07fc7 Compare September 30, 2024 05:44

Resolves some issues

d8ce7e3

Negiiiin requested a review from fcogidi September 30, 2024 18:12

fcogidi and others added 10 commits October 6, 2024 22:40

Remove trainer argument from EvaluationHooks

d0f1711

Refactor datasets

a29a414

Refactor zero shot classification task

2ba0698

Fix pre-commit errors

8f3b946

Merge branch 'VectorInstitute:main' into feature/classification_evalu…

2bddd33

…ation

Added new datasets

02df8cf

Merge branch 'feature/classification_evaluation' of https://github.co…

ae63447

…m/Negiiiin/mmlearn into feature/classification_evaluation

Added diles

716a2fb

Added test cases

75b8d70

Add support for custom configuration in load_huggingface_model function

2b06e11

fcogidi and others added 10 commits October 15, 2024 14:28

Update default configuration for HFCLIPTokenizer

06fb609

Remove commented out code and rearrage code blocks

7b90734

Refactor zero-shot classification eval and config

8e91fe7

Merge remote-tracking branch 'upstream/main' into feature/classificat…

88b1175

…ion_evaluation

Update integration_tests.yml to use snok/install-poetry action and ca…

b501d64

…che poetry dependencies

Removed dataset_info

15dc0e2

Resolved pre-commit errors

29bad87

Debugged datasets

b0993ea

Debugged datasets

ac5c9fd

cleanup

05ee172

fcogidi approved these changes Oct 17, 2024

View reviewed changes

fcogidi merged commit a3147f0 into VectorInstitute:main Oct 18, 2024
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Classification Evaluation #17

Classification Evaluation #17

Negiiiin commented Sep 13, 2024 •

edited by afkanpour

Loading

afkanpour left a comment

fcogidi left a comment

Classification Evaluation #17

Classification Evaluation #17

Conversation

Negiiiin commented Sep 13, 2024 • edited by afkanpour Loading

PR Type

Short Description

afkanpour left a comment

Choose a reason for hiding this comment

fcogidi left a comment

Choose a reason for hiding this comment

Negiiiin commented Sep 13, 2024 •

edited by afkanpour

Loading