r.kappa: Fix failures, garbage output, fallback to category values #2573

marisn · 2022-09-09T14:56:51Z

r.kappa has some strange design decisions and produces strange or outright wrong output if input data doesn't match its expectations.
This PR fixes:

printing garbage instead of NA for the first raster category for commision, ommision and kappa values;
printing raster category values if labels are missing for matrix-only output mode;

…t present

Although original code lacks any explanation why NA should not be printed for the first raster category, I do suspect it stems from idea that the first cat is 0 and before proper NULL support 0 was "no data" value.

raster/r.kappa/prt2csv_mat.c

wenzeslaus

GCC and CodeQL are not happy. Otherwise no idea without testing or tests, sorry.

Probably another candidate for JSON output, but that's out of scope for this PR.

raster/r.kappa/prt2csv_mat.c

marisn · 2022-09-14T19:05:56Z

GCC and CodeQL are not happy. Otherwise no idea without testing or tests, sorry.

There are more issues as some values are calculated incorrectly if input data is not sequential numbers starting at 0. Thus just a draft as more work is needed.

Probably another candidate for JSON output, but that's out of scope for this PR.

I was thinking of shell script style with "-g" flag, but going for JSON also is an option. In separate PR, of course, after this PR is merged as this PR will have to be backported.

These tests should fail as C code fixes will be in upcoming commits.

…gical cases

pii, pi, pj and pii all are arrays with size of ncat thus when nstats < ncat it was causing out of bounds reads and writes. A resulting segfault was reported in: https://trac.osgeo.org/grass/ticket/3978 A test covering this error already was added in previous commit.

wenzeslaus

The code changes make sense to me in light of the individual commit messages (please include these descriptions into the final commit message, you could add pieces of them as comments too).

I executed the tests locally with the original code and in the one case where it actually runs and gives results, I get the same values as the ones in the tests, so the outputs which are working were not broken.

Thank you for the extra effort with the correctness test! I was really just thinking some "current values" check to see the before and after difference (or no difference).

I only noticed sub-optimal use of split in Python in tests which would be nice to have before merge.

raster/r.kappa/testsuite/test_r_kappa.py

Selected r.kappa indentation update from OSGeo#2544 (needed for backport r.kappa fixes in OSGeo#2573)

Selected r.kappa indentation update from #2544 (needed for backport of r.kappa fixes in #2573)

…2573) * use raster category values in matrix output if labels are not present * print NAs also for the first category Although original code lacks any explanation why NA should not be printed for the first raster category, I do suspect it stems from idea that the first cat is 0 and before proper NULL support 0 was "no data" value. * r.kappa: fix incorrect memory access and improve edge case handling pii, pi, pj and pii all are arrays with size of ncat thus when nstats < ncat it was causing out of bounds reads and writes. A resulting segfault was reported in: https://trac.osgeo.org/grass/ticket/3978 * tests for most of functionality

…SGeo#2573) * use raster category values in matrix output if labels are not present * print NAs also for the first category Although original code lacks any explanation why NA should not be printed for the first raster category, I do suspect it stems from idea that the first cat is 0 and before proper NULL support 0 was "no data" value. * r.kappa: fix incorrect memory access and improve edge case handling pii, pi, pj and pii all are arrays with size of ncat thus when nstats < ncat it was causing out of bounds reads and writes. A resulting segfault was reported in: https://trac.osgeo.org/grass/ticket/3978 * tests for most of functionality

marisn added 2 commits September 9, 2022 14:26

r.kappa: use raster category values in matrix output if labels are no…

814f7ea

…t present

r.kappa: print NAs also for the first category

45af603

Although original code lacks any explanation why NA should not be printed for the first raster category, I do suspect it stems from idea that the first cat is 0 and before proper NULL support 0 was "no data" value.

marisn added raster Related to raster data processing C Related code is in C labels Sep 9, 2022

marisn added this to the 8.3.0 milestone Sep 9, 2022

marisn changed the title ~~R kappa fixes~~ r.kappa fixes Sep 9, 2022

github-advanced-security bot found potential problems Sep 9, 2022

View reviewed changes

raster/r.kappa/prt2csv_mat.c Fixed Show resolved Hide resolved

raster/r.kappa/prt2csv_mat.c Fixed Show fixed Hide fixed

r.kappa: use correct formatter for printf

e8777ed

wenzeslaus reviewed Sep 14, 2022

View reviewed changes

raster/r.kappa/prt2csv_mat.c Fixed Show resolved Hide resolved

marisn added 3 commits September 22, 2022 18:26

r.kappa: tests for most of functionality

e858ecd

These tests should fail as C code fixes will be in upcoming commits.

r.kappa: another test case to catch incorrect memory usage in patholo…

6effd90

…gical cases

marisn marked this pull request as ready for review September 23, 2022 12:44

marisn requested a review from wenzeslaus September 23, 2022 12:44

marisn added the backport_needed label Sep 23, 2022

wenzeslaus approved these changes Oct 5, 2022

View reviewed changes

raster/r.kappa/testsuite/test_r_kappa.py Outdated Show resolved Hide resolved

raster/r.kappa/testsuite/test_r_kappa.py Outdated Show resolved Hide resolved

wenzeslaus changed the title ~~r.kappa fixes~~ r.kappa: Fix failures, garbage output, fallback to category values Oct 5, 2022

Better python split use in tests (thanks to wenzeslaus)

5f59a98

marisn merged commit 8dd3ce7 into OSGeo:main Nov 3, 2022

neteler mentioned this pull request Nov 4, 2022

[Bug] r.confusionmatrix testsuite error OSGeo/grass-addons#831

Open

neteler added a commit to neteler/grass that referenced this pull request Nov 10, 2022

r.kappa: r.kappa indentation update from OSGeo#2544

7e1d72b

Selected r.kappa indentation update from OSGeo#2544 (needed for backport r.kappa fixes in OSGeo#2573)

neteler mentioned this pull request Nov 10, 2022

r.kappa: r.kappa indentation update cherry-picked from #2544 #2641

Merged

neteler added a commit that referenced this pull request Nov 10, 2022

r.kappa: r.kappa indentation update from #2544 (#2641)

bd0f6c1

Selected r.kappa indentation update from #2544 (needed for backport of r.kappa fixes in #2573)

neteler modified the milestones: 8.3.0, 8.2.1 Nov 10, 2022

neteler removed the backport_needed label Nov 10, 2022

marisn deleted the r_kappa_fixes branch October 22, 2023 09:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

r.kappa: Fix failures, garbage output, fallback to category values #2573

r.kappa: Fix failures, garbage output, fallback to category values #2573

marisn commented Sep 9, 2022

wenzeslaus left a comment

marisn commented Sep 14, 2022

wenzeslaus left a comment

r.kappa: Fix failures, garbage output, fallback to category values #2573

r.kappa: Fix failures, garbage output, fallback to category values #2573

Conversation

marisn commented Sep 9, 2022

wenzeslaus left a comment

Choose a reason for hiding this comment

marisn commented Sep 14, 2022

wenzeslaus left a comment

Choose a reason for hiding this comment