SemanticSegmentationTask: add class-wise metrics #2130

robmarkcole · 2024-06-19T10:17:53Z

Addresses #2121 for segmentation. Mostly copied from @isaaccorley as here - he is additionally passing on_epoch=True which is also adopted here

DimitrisMantas · 2024-06-20T08:39:32Z

Given that most metrics of interest are broken (e.g., all of them when average="macro" and ignore_index is specified (Lightning-AI/torchmetrics#2443) andJaccardIndex which outputs NaN when average==macro instead when you try to take absent and ignored classes into account with zero_division (Lightning-AI/torchmetrics#2535)), should we make an effort to see if and how we could add our own?

I'm saying this because these are only the issues I've found so far, but I've also noticed other suspicious things like the fact that my classwise recall values are not the same as those in the confusion matrix when you normalize it with respect to ground truth (I haven't checked if this is also the case with precision, so when the matrix is normalized column-wise). I'm also pretty confident that if all of this is wrong then micro averaging is also probably wrong.

I should be pretty easy to compute all these metrics straight from the confusion matrix (assuming it at least is correct) and I've actually tried to reimplent them this way but it hasn't really been a priority because I’ve found that all these wrong (?) values are basically a lower bound of the actual ones. If you look at the official implementations, this is actually what they are doing, and my guess is that they have a bug in their logic later on. But indeed all these metrics inherit from StatScores, basically the confusion matrix.

I’m actually pretty dumbfounded these issues are not a top priority for the TorchMetrics team and instead they focus on adding to their docs but to each their own…

robmarkcole · 2024-06-20T08:54:34Z

@DimitrisMantas good call on my ignoring the ignore_index.! In fairness they do address issues, but have a long backlog. When I made some noise they addressed Lightning-AI/torchmetrics#2198
My opinion is it is better to work with torchmetrics to address the issues, rather than implement from scratch here. I see your comment at Lightning-AI/torchmetrics#2535 (comment) so perhaps a pragmatic approach is not to add new metrics that we have concerns about, but also to create specific issues which track these concerns

torchgeo/trainers/segmentation.py

DimitrisMantas · 2024-06-20T09:01:18Z

Sure, that makes sense; please excuse the rant haha.

robmarkcole · 2024-06-20T09:14:33Z

Applied on_epoch=True, to all steps for consistency - this results in both per epoch and per step being reported for train only - perhaps this is why @isaaccorley did not apply to train?

train_loss_epoch | 0.028535427525639534
train_loss_step | 0.00008003244874998927
train_AverageAccuracy_epoch | 0.9101453423500061
train_AverageAccuracy_step | 0.9124529361724854

Note that Val is unaffected:

val_AverageAccuracy | 0.8227439522743225

For a task with 2 classes there are a grand total of Metrics (52) being reported between train & val

isaaccorley · 2024-06-20T12:22:35Z

I just set to be explicit but I think that pytorch lightning or torchmetrics auto sets on_epoch to be False for training and True for all else.

DimitrisMantas · 2024-06-20T12:51:04Z

You need to set both on_step and on_epoch to get logs only per step or per epoch.

robmarkcole · 2024-06-20T13:41:57Z

@DimitrisMantas now just performing on_step for train loss, so a more manageable 36 metrics now

robmarkcole · 2024-06-21T08:29:04Z

Not sure about this failing test ValueError: Problem with given class_path 'torchgeo.trainers.SemanticSegmentationTask'

isaaccorley · 2024-06-21T14:55:05Z

Must be an issue with on of the minimum versions of the package since it's passing for the other tests.

torchgeo/trainers/segmentation.py

adamjstewart · 2024-08-06T11:40:45Z

We can definitely increase the min version of torchmetrics if we need to.

DimitrisMantas · 2024-09-05T13:45:43Z

@robmarkcole I can confirm the recommended approach yields consistent results.

adamjstewart · 2024-10-01T20:17:42Z

Sorry it's taken me so long to review. I was originally hung up on the hack required to support ClasswiseWrapper in log_dict, but I've gotten over that. If torchmetrics wants to make that easier in the future, great. But I also really want this feature, so let's not wait on that.

Only remaining concern is that the code required to loop over all metrics and averages actually makes the code more complicated and difficult to read than avoiding loops entirely. If we want to add new metrics in the future, it looks non-straightforward. I wonder if we can loop over averages only and still keep things simple.

I would also really like to see this done for ClassificationTask too so SemanticSegmentationTask doesn't have a different set of features or metrics.

robmarkcole · 2024-10-02T09:45:44Z

@adamjstewart please see above comment on using _epoch_end

isaaccorley · 2025-01-01T08:28:06Z

I think it's better practice to do on_epoch=True for val and test metrics so that you will get a single value per metric per epoch for the validation and test set. Otherwise you will get a metric per step which is not great for comparing performance across models.

robmarkcole · 2025-01-02T09:40:08Z

To clarify if we want on_step=False, or True for train. I had a look what others do, rslearn do not but clay do

isaaccorley · 2025-01-02T14:35:30Z

That's a good question, I normally set the train to only record steps and not the full epoch average because depending on how many metrics and the size of your train set this can use up a lot of memory. I think for now we can set both as true.

robmarkcole · 2025-01-02T14:37:48Z

From memory I think if using both the loss is named loss_step and loss_epoch, trying to get a model trained to demonstrate. I assume most people will be fine with this, but it could require some explanation

isaaccorley

This LGTM. Any concerns with merging this @adamjstewart?

adamjstewart · 2025-01-03T09:57:14Z

I don't think these concerns have been addressed: #2130 (comment)

isaaccorley · 2025-01-03T15:26:26Z

I agree, I think just making a list of the metrics instead of using loops is probably more readable and easier to add/remove individual metrics based on a users need

robmarkcole · 2025-02-01T17:45:43Z

merging main really messed this branch up, will create a new branch/PR

calebrob6 · 2025-02-17T23:48:01Z

Coming back to this to say I just reimplemented in a private project, would love to have this upstream!

robmarkcole · 2025-02-18T08:58:05Z

Honestly dont know when I will get around to it - the last occasion I had a few hours I ended up debugging dataset issues

adamjstewart · 2025-02-22T10:52:35Z

pyproject.toml

@@ -74,8 +74,8 @@ dependencies = [
    "timm>=0.4.12",
    # torch 1.13+ required by torchvision
    "torch>=1.13",
-    # torchmetrics 0.10+ required for binary/multiclass/multilabel classification metrics
-    "torchmetrics>=0.10",
+    # torchmetrics 1.1.1+ required for average argument to MeanAveragePrecision


According to Lightning-AI/torchmetrics@63c7bbe the argument didn't exist until 1.2

robmarkcole added 3 commits June 19, 2024 09:13

Add average metrics

23fa1fb

Add average metrics

b7d8305

refactor: Rename metrics in SemanticSegmentationTask

b1526fa

github-actions bot added the trainers PyTorch Lightning trainers label Jun 19, 2024

Ruff format

341e272

DimitrisMantas reviewed Jun 20, 2024

View reviewed changes

torchgeo/trainers/segmentation.py Show resolved Hide resolved

Use ignore_index

024feda

robmarkcole added 2 commits June 20, 2024 09:10

pass on_epoch

04cac59

on_epoch to train too

56f20fc

Disable on_step for train metrics

3d2b309

robmarkcole added 2 commits June 20, 2024 14:42

Merge branch 'main' into update-metrics

9af1493

ruff format

192c496

Merge branch 'main' into update-metrics

73b710f

robmarkcole added 2 commits June 23, 2024 06:29

Merge branch 'main' into update-metrics

8ce8c30

Merge branch 'main' into update-metrics

e4ed9fd

robmarkcole commented Jul 2, 2024

View reviewed changes

torchgeo/trainers/segmentation.py Show resolved Hide resolved

robmarkcole added 4 commits July 8, 2024 09:19

Merge branch 'main' into update-metrics

d9c2688

Merge branch 'main' into update-metrics

400fae3

Merge branch 'main' into update-metrics

f4c793e

Merge branch 'main' into update-metrics

3b629ea

calebrob6 previously approved these changes Sep 28, 2024

View reviewed changes

robmarkcole added 2 commits December 31, 2024 09:09

Merge branch 'main' into update-metrics

07e7c4d

Address merge conflicts

ff761f2

Specify on_epoch

59ba3c8

robmarkcole dismissed calebrob6’s stale review via 59ba3c8 January 2, 2025 09:37

Ruff format

1184647

isaaccorley previously approved these changes Jan 3, 2025

View reviewed changes

Merge branch 'main' into update-metrics

82eecdc

robmarkcole dismissed isaaccorley’s stale review via 82eecdc February 1, 2025 15:37

merge main

bcada43

github-actions bot added testing Continuous integration testing datamodules PyTorch Lightning datamodules labels Feb 1, 2025

robmarkcole added 2 commits February 1, 2025 17:43

Merge branch 'microsoft:main' into main

bc3bb3c

merge main

ae7f061

robmarkcole closed this Feb 1, 2025

robmarkcole mentioned this pull request Feb 1, 2025

Update seg and class metrics #2554

Closed

adamjstewart removed this from the 0.7.0 milestone Feb 3, 2025

adamjstewart reviewed Feb 22, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SemanticSegmentationTask: add class-wise metrics #2130

SemanticSegmentationTask: add class-wise metrics #2130

robmarkcole commented Jun 19, 2024 •

edited

Loading

DimitrisMantas commented Jun 20, 2024 •

edited

Loading

robmarkcole commented Jun 20, 2024

DimitrisMantas commented Jun 20, 2024

robmarkcole commented Jun 20, 2024 •

edited

Loading

isaaccorley commented Jun 20, 2024 •

edited

Loading

DimitrisMantas commented Jun 20, 2024

robmarkcole commented Jun 20, 2024

robmarkcole commented Jun 21, 2024

isaaccorley commented Jun 21, 2024

adamjstewart commented Aug 6, 2024

DimitrisMantas commented Sep 5, 2024

adamjstewart commented Oct 1, 2024

robmarkcole commented Oct 2, 2024

isaaccorley commented Jan 1, 2025

robmarkcole commented Jan 2, 2025 •

edited

Loading

isaaccorley commented Jan 2, 2025 •

edited

Loading

robmarkcole commented Jan 2, 2025 •

edited

Loading

isaaccorley left a comment

adamjstewart commented Jan 3, 2025

isaaccorley commented Jan 3, 2025

robmarkcole commented Feb 1, 2025

calebrob6 commented Feb 17, 2025

robmarkcole commented Feb 18, 2025

adamjstewart Feb 22, 2025

SemanticSegmentationTask: add class-wise metrics #2130

SemanticSegmentationTask: add class-wise metrics #2130

Conversation

robmarkcole commented Jun 19, 2024 • edited Loading

DimitrisMantas commented Jun 20, 2024 • edited Loading

robmarkcole commented Jun 20, 2024

DimitrisMantas commented Jun 20, 2024

robmarkcole commented Jun 20, 2024 • edited Loading

isaaccorley commented Jun 20, 2024 • edited Loading

DimitrisMantas commented Jun 20, 2024

robmarkcole commented Jun 20, 2024

robmarkcole commented Jun 21, 2024

isaaccorley commented Jun 21, 2024

adamjstewart commented Aug 6, 2024

DimitrisMantas commented Sep 5, 2024

adamjstewart commented Oct 1, 2024

robmarkcole commented Oct 2, 2024

isaaccorley commented Jan 1, 2025

robmarkcole commented Jan 2, 2025 • edited Loading

isaaccorley commented Jan 2, 2025 • edited Loading

robmarkcole commented Jan 2, 2025 • edited Loading

isaaccorley left a comment

Choose a reason for hiding this comment

adamjstewart commented Jan 3, 2025

isaaccorley commented Jan 3, 2025

robmarkcole commented Feb 1, 2025

calebrob6 commented Feb 17, 2025

robmarkcole commented Feb 18, 2025

adamjstewart Feb 22, 2025

Choose a reason for hiding this comment

robmarkcole commented Jun 19, 2024 •

edited

Loading

DimitrisMantas commented Jun 20, 2024 •

edited

Loading

robmarkcole commented Jun 20, 2024 •

edited

Loading

isaaccorley commented Jun 20, 2024 •

edited

Loading

robmarkcole commented Jan 2, 2025 •

edited

Loading

isaaccorley commented Jan 2, 2025 •

edited

Loading

robmarkcole commented Jan 2, 2025 •

edited

Loading