Fixed CI/CD Tests for Edge-Endpoint #70

honeytung · 2024-07-03T20:54:11Z

This PR contains the following changes to fix CI/CD tests failing:

Updates Groundlight SDK version from 0.13.1 to 0.17.0
Updates FrameGrab from 0.4.3 to 0.5.0
Fixed tests from motion-detection not passing

As a side note, for future improvements it might be better if we can either make sure the tests also passed for edge-endpoint if we update the SDK, or use a matching sdk tests for the sdk version the edge-endpoint is running if we are not updating edge's sdk version that often.

…en IQ is created on the edge

brandon-groundlight

The test might just be flaky, but I'd like to see the motion detection test pass, not just skip, before we update the edge-endpoint here

brandon-groundlight · 2024-07-03T21:33:29Z

pyproject.toml

@@ -21,7 +21,7 @@ kubernetes = "^27.2.0"
 jinja2 = "^3.1.2"
 SQLAlchemy = "2.0.22"
 APScheduler = "3.10.4"
-groundlight = "^0.13.1"
+groundlight = "^0.17.0"


This makes me happy

brandon-groundlight · 2024-07-03T21:43:37Z

test/api/test_motdet.py

@@ -240,28 +240,40 @@ def test_motion_detection_not_sufficient_if_doesnt_meet_conf_threshold(gl: Groun
    detector_id = DETECTORS["dog_detector"]["detector_id"]
    detector = gl.get_detector(id=detector_id)

+    # Set detector confidence threshold to 0.90
+    gl.update_detector_confidence_threshold(detector.id, 0.90)


This is just to restore the detector to a known state, because we might change it in a previous test run?

Yes, the purpose of this test is to try to see if the motion detector, even when there is no motion detected, will escalated to the cloud if the detector confidence threshold changed to a higher value than the first IQ's confidence.

brandon-groundlight · 2024-07-03T21:49:18Z

test/api/test_motdet.py

+    if new_response.result is None or new_response.result.confidence is None or new_response.result.confidence > 0.95:
+        # Revert the confidence threshold to 0.90
+        gl.update_detector_confidence_threshold(detector.id, 0.90)
+        pytest.skip("This test requires that the cached image query response has a confidence < 0.95")


It looks like this test skipped when you ran it, I'm not sure if we'll get to the following assertion which is the meat of what we're testing here. Is there a way to make this more reliable? Is it just a matter of raising the confidence even higher?

Yea @tyler-romero and I found out that the test originally was made based on a misunderstanding of the SDK that the confidence_threshold will be forwarded to the API.

Since it is polling the client, it will just timeout if the original IQ is already high enough. The test right now using the dog detector can already achieved a confidence of 0.98.

If we want to make this test working for most scenarios, I can try to raise it to 0.99 with a wait time long enough to trigger a cloud labeler to respond.

honeytung · 2024-07-08T18:31:44Z

After discussion with @tyler-romero, the test that failed is created from a SDK implementation misunderstanding of the confidence threshold will forward to the API (confidence threshold is a client only polling). Thus, it is safe for this test case to be removed.

brandon-groundlight

I'm on board with removing that test too

tyler-romero · 2024-07-08T21:01:42Z

app/core/utils.py

            confidence=confidence,
            label=label,
        ),
+        patience_time=30.0,


Was there a specific reason to add this? It should be taken as an argument to create_iqe and the true requested patience_time should be set here from post_image_query

Yea I added when I was doing some testing and forget to add to post_image_query. Will add it back.

tyler-romero

LGTM, one minor comment

honeytung added 30 commits June 28, 2024 17:09

Updated sdk version

2cebf22

Updated sdk model

1f5e275

Fixed pydantic checks

b90a327

Removed metadata

6c73ba6

Added pydantic module validation

6b28fcb

Increased confidence target value

2157023

Added debugging text

0513f53

Updated inference threshold

e07cdfc

Updated witb new patience_time

d66f149

Added more debugging messages

627849f

Allowed pytest to display logs

86a0caa

Changed the motion detection test so that it checks the confidence wh…

581f8ff

…en IQ is created on the edge

Increased confidence threshold

59b05d6

Fixed TypeError bug

624bd0b

Reverted test case

eab80f9

Pytest tests

37bd6a9

Updated image

60553f8

Added comments

3a1e41b

Added more information in error messages

0e2fcd5

Updated FrameGrab to 0.5.2

92a5ecb

Added logic to check if the request has timeout

fb53c5e

Added comments

a0f2523

Fixed typo

cd34d62

Increased wait time

1bd778f

Added tests for human review

5c8c0b6

Added confidence logging messages

2b715bc

Added confidence message

6e27aa0

Updated motion detection test

12a31d3

Removed debug logs

84f4840

Fixed confidence logic for pytest skip

0057541

honeytung requested review from tyler-romero and brandon-groundlight July 3, 2024 20:54

Automatically reformatting code with black and isort

8f873e2

brandon-groundlight reviewed Jul 3, 2024

View reviewed changes

honeytung and others added 13 commits July 3, 2024 17:03

Updated test to use a higher detector confidence threshold

fd8232b

Update confidence threshold to be guarantee cloud escalation

a5db3ad

Refactored code

9ebc22c

Automatically reformatting code with black and isort

ae78c67

Added debug message for _is_confident_enough()

e85b22f

Automatically reformatting code with black and isort

703f65c

Removed threshold

75300e5

Automatically reformatting code with black and isort

945d276

Updated threshold

66cf159

Removed wait time to prevent from escalating to cloud labelers

a6aaf2c

Automatically reformatting code with black and isort

a65561b

Removed test case as it is based on sdk misunderstanding

c40d38c

Automatically reformatting code with black and isort

2d0e32f

brandon-groundlight approved these changes Jul 8, 2024

View reviewed changes

tyler-romero reviewed Jul 8, 2024

View reviewed changes

tyler-romero approved these changes Jul 8, 2024

View reviewed changes

honeytung and others added 2 commits July 8, 2024 15:00

Added patience_time

d4c8720

Automatically reformatting code with black and isort

a94c2cf

honeytung merged commit a506ef5 into main Jul 8, 2024

honeytung deleted the cicd-fix branch July 8, 2024 22:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixed CI/CD Tests for Edge-Endpoint #70

Fixed CI/CD Tests for Edge-Endpoint #70

honeytung commented Jul 3, 2024

brandon-groundlight left a comment

brandon-groundlight Jul 3, 2024

brandon-groundlight Jul 3, 2024

honeytung Jul 3, 2024

brandon-groundlight Jul 3, 2024

honeytung Jul 4, 2024

honeytung commented Jul 8, 2024

brandon-groundlight left a comment

tyler-romero Jul 8, 2024

honeytung Jul 8, 2024

tyler-romero left a comment

Fixed CI/CD Tests for Edge-Endpoint #70

Fixed CI/CD Tests for Edge-Endpoint #70

Conversation

honeytung commented Jul 3, 2024

brandon-groundlight left a comment

Choose a reason for hiding this comment

brandon-groundlight Jul 3, 2024

Choose a reason for hiding this comment

brandon-groundlight Jul 3, 2024

Choose a reason for hiding this comment

honeytung Jul 3, 2024

Choose a reason for hiding this comment

brandon-groundlight Jul 3, 2024

Choose a reason for hiding this comment

honeytung Jul 4, 2024

Choose a reason for hiding this comment

honeytung commented Jul 8, 2024

brandon-groundlight left a comment

Choose a reason for hiding this comment

tyler-romero Jul 8, 2024

Choose a reason for hiding this comment

honeytung Jul 8, 2024

Choose a reason for hiding this comment

tyler-romero left a comment

Choose a reason for hiding this comment