Set a reasonable absolute tolerance to avoid spurious negatives for close-to-zero signals #66

casella · 2024-07-02T11:58:02Z

We had many false regressions in the MSL on signals that are supposed to be zero but aren't, due to numerical errors, see modelica/ModelicaStandardLibrary#4421

One solution is to remove those signals outright, but then you can't really make sure that they are close to zero, so that's not a good solution in general, unless those close-to-zero signals are somehow redundant.

@HansOlsson in that ticket suggests a pragmatic but actually quite effective fix for this problem: set the tube width not only based on a relative tolerance, but also on some a-priori nominal value for signals, e.g. 1e-3. This means that the error criterion should be something like:

(v - v_ref)/max(abs(v), nom) < tol

with nom = 1e-3 and tol = 0.002, the current setting for MSL testing.

@beutlich could you please take care of that at your earliest convenience?

Keeping @GallLeo in the loop.

beutlich · 2024-07-16T16:57:20Z

Can you please add the two two-column CSV files where the behavior can be reproduced.

casella · 2024-07-31T09:16:37Z

@beutlich you can find many examples in PR modelica/ModelicaStandardLibrary#4420, where I first wanted to actually remove the offending signals from the CSV files. The idea is that this should not be necessary (i.e. the amended CSV-compare tool should pass the verification without removing those signals) with the change proposed in this ticket.

casella · 2024-07-31T09:26:50Z

This is a MWE of two CSV files that are expected to fail on v2 with the current CSV-compare tool and should succeed instead with the proposed change, nom = 1e-3, and tol = 0.002 , if I am not mistaken.

CSV1:

"time", "v1", "v2"
0.0, 0.0, 1e-8 
1.0, 1.0, -2e-8
2.0, 1.5, 1.5e-9
3.0, 1.0, 5e-9
4.0, 0.0, -1e-8

CSV2:

"time", "v1", "v2"
0.0, 0.0002, 3e-8 
1.0, 1.0004, 4e-8
2.0, 1.5008, 2.5e-9
3.0, 1.0002, 1e-9
4.0, 0.0, 2e-8

Thanks!

bilderbuchi · 2024-08-05T13:32:53Z

set the tube width not only based on a relative tolerance, but also on some a-priori nominal value for signals, e.g. 1e-3. This means that the error criterion should be something like:
(v - v_ref)/max(abs(v), nom) < tol
with nom = 1e-3 and tol = 0.002, the current setting for MSL testing.

You saying "should be something like" implies to me that you're not totally sure about the specifics, yourself.

To avoid coming up with some tolerance definition off-the-cuff (that maybe subtly differs from other similar functions, please let me suggest to use the one that Python's math.isclose uses:

abs(v - v_ref) <= max(tol*max(abs(v), abs(v_ref)), nom)

This is the same relation that Julia's isapprox uses.

NB: Numpy uses a slightly different formulation, which I personally don't like because it's not symmetric in its arguments, as tol is scaled with only one of them:

abs(v - v_ref) <= nom + tol*abs(v_ref)

casella · 2024-08-05T16:08:36Z

You saying "should be something like" implies to me that you're not totally sure about the specifics, yourself.

I meant I didn't put too much effort in making a precise proposal, because of lack of time 😅

In fact, what I wanted to write was

abs(v - v_ref)/max(abs(v), nom) < tol

(of course you need the absolute value on the LHS). This is equivalent to

abs(v - v_ref) < tol*max(abs(v), nom)
abs(v - v_ref) < max(tol*abs(v), tol*nom)

it seems reasonable to me to take the max between abs(v) and abs(v_ref), as sugggested by @bilderbuchi . I'm not sure if that formula is sensible regarding nom, I guess it should be

abs(v - v_ref) <= tol*max(max(abs(v), abs(v_ref)), nom)

as asking for

abs(v - v_ref) <= nom

seems a bit too lax to me.

Ditto for the Numpy implementation, I guess it should be

abs(v - v_ref) <= tol*nom + tol*abs(v_ref)

The idea is that the absolute value of the error should be less than the tolerance times the largest of three values: the variable itself, its reference value, and its nominal value, in order to avoid being too strict in cases where the variable is accidentally close to zero.

@bilderbuchi are we on the same page?

casella · 2024-08-05T16:10:03Z

Of course this is based on the assumption that nom is the nominal value of variable v, i.e., its order of magnitude.

bilderbuchi · 2024-08-06T07:12:27Z

The idea is that the absolute value of the error should be less than the tolerance times the largest of three values: the variable itself, its reference value, and its nominal value, in order to avoid being too strict in cases where the variable is accidentally close to zero.

Yes, you are of course right. I made a sloppy mistake in "translating" the Python docs examples to your nomenclature above, in that I simply replaced atol (absolute tolerance) by nom. I didn't think that through, the replacement should be tol*nom in our situation, as you rightly point out.

casella · 2024-08-07T12:26:14Z

@beutlich do you have a clear idea about how to proceed now?

beutlich · 2024-08-07T20:08:32Z

Not sure if I found the right place to consider nominal values, but this quick-and-dirty change could work:

--- a/Modelica_ResultCompare/CurveCompare/TubeSize.cs
+++ b/Modelica_ResultCompare/CurveCompare/TubeSize.cs
@@ -126,7 +126,7 @@ namespace CurveCompare
         {
             double epsilon = 1e-12;
             baseX = Math.Max(Math.Max(reference.X.Max() - reference.X.Min(), Math.Abs(reference.X.Min())), epsilon);
-            ratio = Math.Max(Math.Max(reference.Y.Max() - reference.Y.Min(), Math.Abs(reference.Y.Min())), epsilon) / baseX;
+            ratio = Math.Max(Math.Max(Math.Max(reference.Y.Max() - reference.Y.Min(), Math.Abs(reference.Y.Min())), epsilon) / baseX, 0.001);
             baseY = baseX * ratio;
             return;
         }

casella · 2024-08-08T08:02:30Z

@beutlich I'm trying to figure out the code you pointed out

csv-compare/Modelica_ResultCompare/CurveCompare/TubeSize.cs

Lines 99 to 132 in 18b309d

    
           /// <summary> 
        
           /// Calculates standard values for BaseX , BaseY and Ratio. 
        
           /// </summary> 
        
           public void SetStandardBaseAndRatio() 
        
           { 
        
               // set baseX 
        
               baseX = reference.X.Max() - reference.X.Min(); //reference.X.Max() - reference.X.Min() + Math.Abs(reference.X.Min()); 
        
               if (baseX == 0) // nonsense case, no data 
        
                   baseX = Math.Abs(reference.X.Max()); 
        
               if (baseX == 0) // nonsense case, no data 
        
                   baseX = 1; 
        
               // set baseY 
        
               baseY = reference.Y.Max() - reference.Y.Min(); 
        
               if (baseY == 0) // rare special case 
        
                   baseY = Math.Abs(reference.Y.Max()); 
        
               if (baseY == 0) // rare special case 
        
                   baseY = 0.00000000000000001; 
        
               // set ratio 
        
               if (baseX != 0) 
        
                   ratio = baseY / baseX; 
        
               else 
        
                   ratio = 0; 
        
           } 
        
           /// <summary> 
        
           /// Calculates former standard values for BaseX , BaseY and Ratio. 
        
           /// </summary> 
        
           public void SetFormerBaseAndRatio() 
        
           { 
        
               double epsilon = 1e-12; 
        
               baseX = Math.Max(Math.Max(reference.X.Max() - reference.X.Min(), Math.Abs(reference.X.Min())), epsilon); 
        
               ratio = Math.Max(Math.Max(reference.Y.Max() - reference.Y.Min(), Math.Abs(reference.Y.Min())), epsilon) / baseX; 
        
               baseY = baseX * ratio; 
        
               return; 
        
           }

but I'm not familiar with it. The first question is: what is the difference between SetStandardBaseAndRatio() and SetFormerBaseAndRatio() ? What do "standard values" and "former standard values" actually mean? Which of the two functions is actually used?

If SetFormerBaseAndRatio() is used, as you suggest, then I reckon we should just set epsilon to 0.001 instead of 1e-12 and keep the rest of the code as it is. This would mean that when the maximum between the range of the reference and its minimum value is a very small number (i.e., numerical noise around zero), we consider a default amplitude of 0.001 for all subsequent elaborations.

Maybe we should also adapt SetStandarBaseAndRatio(), just in case?

beutlich · 2024-08-08T17:40:16Z

I am neither sure what "former" and "standard" refers to. When debugging, I noticed that the "former" function is called.

bilderbuchi · 2024-08-09T06:04:10Z

I'm confused about this part of the code, it seems to basically set the extent (BaseX, BaseY) and aspect ratio of the whole reference curve in question, and seems not to be involved with the tolerance computation at all? reference.X is an array of x values, which Min() and Max() presumably find the highest and lowest value of.
So, I'm not sure if this is the right place to edit?

"Former" seems to refer to some formerly used method of computation:

csv-compare/Modelica_ResultCompare/CurveCompare/TubeSize.cs

Line 88 in 18b309d

    
           /// <param name="formerBaseAndRatio">Base and Ratio are calculated like in the former CSV-Compare, if true;<para>

bilderbuchi · 2024-08-09T06:33:52Z

The point where the tolerance should come into play is at

csv-compare/Modelica_ResultCompare/CurveCompare/Algorithms/Rectangle.cs

Lines 52 to 53 in 18b309d

    
           report.Lower = CalculateLower(reference, size); 
        
           report.Upper = CalculateUpper(reference, size);

where the lower and upper extent of the tube around the reference are computed.
(or the equivalent function in the Ellipse.cs file, I could not find out which one is ultimately used).

The confusing thing is that in that case the whole extent of the curve values would be used for creating the tube, which would be meaningless. Unfortunately, the code structure is full of indirections and similarly named variables, so it's very hard to find out more just using the Github code viewer.
It could be possible that here, we land in the Options1 branch and compute the tube size not in the standard way (with the above curve extents), but with some arbitrary Value, which I think then would end up being hardcoded to 0.002. AFAICT, this would then put a tube of "width" (in some sense) 0.002 around the reference (irrespective of absolute, relative tolerance of the nearest reference point).

You'd have to analyze the program behaviour and results to find out if that is the case, and ideally add some tests with some constructed expected-pass and expected-fail curves to ensure the expected behaviour. AFAICT, this program does not have a test suite, yet.

That's about as far as I can help, here -- I hope it does!

casella · 2024-08-09T09:30:25Z

I have no idea how this code actually works in detail, but to me the case is quite clear.

The "former" code, which according to @beutlich is the one that is actually executed, computes baseY as

Math.Max(Math.Max(reference.Y.Max() - reference.Y.Min(), Math.Abs(reference.Y.Min())), epsilon)

I'm not sure how exactly this baseY value is used, but it is clear that it ultimately determines the width and/or height of the tolerance tube based on the range of Y values of the reference file.

Here, the range is (most reasonably) defined as max(y_max - y_min, y_min); this ensures that if you have a signal with a wide span of y values, the span is taken as baseY, while in the case the signal is (nearly) constant, you take the absolute value of the minimum value. The corner case when the signal is constant zero was handled by adding a third max was with a small epsilon value, to avoid baseY being zero.

So far so good, except that this results into too tight tubes when we have variables that are near zero because of balance equations, but are supposed to have a range which is much larger than the actual span of values in the result file.

My argument (Ockham's razor rules!) is that we don't actually need to get into the details of how the tube algorithm works. We only need to make sure that baseY, i.e. the range of y values of the reference, is not excessively small for reference values that happen to be nearly zero (e.g. because of balance equations), but are supposed to have a much wider a-priori range. So, we should take baseY to be the max between the value as computed currently (which is OK in all cases except near-zero variables), and the nominal value.

Since we don't have nominal attribute in the CSV file, @HansOlsson pragmatic proposal, which I fully endorse, is to take a value of 0.001, which is small enough to avoid most of the false negatives we get now. The only drawback is that there could be some reference values with nominal values much smaller than this, in which case wrong values could be accepted. So be it, that's much better than removing the near-zero references altogether from the reference files, which would lead to accepting a lot more errors, potentially.

I just created PR #67 based on these considerations.

casella · 2024-08-09T09:32:26Z

BTW, if SetStandardBaseAndRatio() is actually not used and is thus dead code, we should probably remove it, or at least write that in the comment.

casella · 2024-09-10T10:48:17Z

Discussion during the MAP-LIB meeting Sep 10: go ahead with #67, get @GallLeo to compile it and use it for the new regression testing, and keep MAP-LIB in the loop.

casella mentioned this issue Sep 10, 2024

Avoid values of baseY less than 0.001 #67

Merged

beutlich linked a pull request Sep 10, 2024 that will close this issue

Avoid values of baseY less than 0.001 #67

Merged

beutlich closed this as completed in #67 Sep 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Set a reasonable absolute tolerance to avoid spurious negatives for close-to-zero signals #66

Set a reasonable absolute tolerance to avoid spurious negatives for close-to-zero signals #66

casella commented Jul 2, 2024 •

edited

Loading

beutlich commented Jul 16, 2024

casella commented Jul 31, 2024

casella commented Jul 31, 2024

bilderbuchi commented Aug 5, 2024

casella commented Aug 5, 2024 •

edited

Loading

casella commented Aug 5, 2024

bilderbuchi commented Aug 6, 2024 •

edited

Loading

casella commented Aug 7, 2024

beutlich commented Aug 7, 2024 •

edited

Loading

casella commented Aug 8, 2024

beutlich commented Aug 8, 2024

bilderbuchi commented Aug 9, 2024

bilderbuchi commented Aug 9, 2024 •

edited

Loading

casella commented Aug 9, 2024 •

edited

Loading

casella commented Aug 9, 2024

casella commented Sep 10, 2024

Set a reasonable absolute tolerance to avoid spurious negatives for close-to-zero signals #66

Set a reasonable absolute tolerance to avoid spurious negatives for close-to-zero signals #66

Comments

casella commented Jul 2, 2024 • edited Loading

beutlich commented Jul 16, 2024

casella commented Jul 31, 2024

casella commented Jul 31, 2024

bilderbuchi commented Aug 5, 2024

casella commented Aug 5, 2024 • edited Loading

casella commented Aug 5, 2024

bilderbuchi commented Aug 6, 2024 • edited Loading

casella commented Aug 7, 2024

beutlich commented Aug 7, 2024 • edited Loading

casella commented Aug 8, 2024

beutlich commented Aug 8, 2024

bilderbuchi commented Aug 9, 2024

bilderbuchi commented Aug 9, 2024 • edited Loading

casella commented Aug 9, 2024 • edited Loading

casella commented Aug 9, 2024

casella commented Sep 10, 2024

casella commented Jul 2, 2024 •

edited

Loading

casella commented Aug 5, 2024 •

edited

Loading

bilderbuchi commented Aug 6, 2024 •

edited

Loading

beutlich commented Aug 7, 2024 •

edited

Loading

bilderbuchi commented Aug 9, 2024 •

edited

Loading

casella commented Aug 9, 2024 •

edited

Loading