NameValueCollection.ToQueryString performance optimisation #4952

stevejgordon · 2020-08-10T12:44:33Z

This commit optimises the internal ToQueryString extension method on the NameValueCollection type. It avoids some intermediate string allocations by using a zero-alloc buffer to format the final query string.

Adds a test for the extension method to validate existing behaviour is unaffected by the change of implementation.
Adds the System.Memory package to support Span<T> usage across all target frameworks. This is registered conditionally for targets before NetStandard 2.1. I had some downgrade errors when using the latest package version, so I've aligned with the existing System.Buffers version for the time being.

I've opened this against master although it may be valuable to back port to existing supported versions?

The exact performance improvement varies based on the underlying collection, but in all cases resulted in less allocations and executes equal to or faster than the original code.

.NET Core 2.1 Benchmark Results

private NameValueCollection _nvc = new NameValueCollection
{
    { "q", "title:\"The Right Way\" AND mod_date:[20020101 TO 20030101]" },
    { "from", "10000" },
    { "request_cache", bool.TrueString },
    { "size", "100" }
};

|   Method |     Mean |     Error |    StdDev | Ratio | RatioSD |  Gen 0 | Gen 1 | Gen 2 | Allocated |
|--------- |---------:|----------:|----------:|------:|--------:|-------:|------:|------:|----------:|
| Original | 4.105 us | 0.0901 us | 0.2614 us |  1.00 |    0.00 | 0.3052 |     - |     - |    1288 B |
|      New | 3.652 us | 0.0730 us | 0.1898 us |  0.89 |    0.07 | 0.1755 |     - |     - |     736 B |

Allocations reduced by 42.9%
11% faster

For smaller collections and key/value lengths, the allocation improvement can be greater. I've seen up to 77% on some test cases where no URI encoding need occur.

Closes #4951

This commit optimises the internal `ToQueryString` extension method on the `NameValueCollection` type. It avoids some intermediate string allocations by using a zero-alloc buffer to format the final query string. - Adds a test for the extension method to validate existing behaviour is unaffected by the change of implementation. - Adds the `System.Memory` package to support `Span<T>` usage across all target frameworks. This is registered conditionally for targets before NetStandard 2.1. I had some downgrade errors when using the latest package version, so I've aligned with the existing `System.Buffers` version for the time being.

elasticmachine · 2020-08-10T12:44:35Z

Since this is a community submitted pull request, a Jenkins build has not been kicked off automatically. Can an Elastic organization member please verify the contents of this patch and then kick off a build manually?

tests/Tests/Extensions/NameValueCollectionExtensionsTests.cs

Mpdreamz

LGTM!, I did suggest one more test case testing the encoding of a 4byte character.

The failing windows integration test is a request timeout on Tests.Indices.IndexManagement.RolloverIndex.RolloverIndexApiTests. and is unrelated.

tests/Tests/Extensions/NameValueCollectionExtensionsTests.cs

stevejgordon · 2020-08-13T08:08:24Z

@Mpdreamz Modified the code slightly to assume worst case that every byte requires 3 char encoding. This avoids overflowing the buffer, even for strings entirely made up of emojis! Added suggested test + one other for 2 byte character.

Mpdreamz · 2020-08-13T10:24:06Z

src/Elasticsearch.Net/Extensions/NameValueCollectionExtensions.cs

-				maxLength += 1 + (key.Length + nv[key]?.Length ?? 0) * 3; // '=' char + worst case assume all key/value chars are escaped
+				var bytes = Encoding.UTF8.GetByteCount(key) + Encoding.UTF8.GetByteCount(nv[key] ?? string.Empty);
+				var maxEncodedSize = bytes * 3; // worst case, assume all bytes are URL escaped to 3 chars
+				maxLength += 1 + maxEncodedSize; // '=' + encoded chars


I think the 1 character padding could potentially 0 when nv[key] is empty. Since we don't write =. Not worth the conditional and added complexity though!

Mpdreamz · 2020-08-13T10:24:40Z

The integration test failures are due to changes on the SNAPSHOT version of the server that we have not addressed yet. Opened #4957 to fix those.

The auto label failure is due to me creating a new label after this PR was opened and applying it to this PR.

Thanks for the updates @stevejgordon!

…tion (#5731) * NameValueCollection.ToQueryString performance optimisation (#4952) * Update package locks * Update license header and remove BOM

stevejgordon commented Aug 10, 2020

View reviewed changes

tests/Tests/Extensions/NameValueCollectionExtensionsTests.cs Show resolved Hide resolved

Mpdreamz added backport 7.x labels Aug 12, 2020

Mpdreamz approved these changes Aug 12, 2020

View reviewed changes

tests/Tests/Extensions/NameValueCollectionExtensionsTests.cs Outdated Show resolved Hide resolved

PR feedback and safer max length calculation.

f95c007

Mpdreamz reviewed Aug 13, 2020

View reviewed changes

Mpdreamz merged commit dff2c6a into elastic:master Aug 13, 2020

stevejgordon mentioned this pull request Aug 24, 2020

RequestData.CreatePathWithQueryStrings Performance Optimisation #4980

Closed

stevejgordon deleted the #4951 branch June 16, 2021 12:10

stevejgordon restored the #4951 branch June 16, 2021 12:10

stevejgordon deleted the #4951 branch June 16, 2021 12:10

stevejgordon added a commit that referenced this pull request Jun 16, 2021

NameValueCollection.ToQueryString performance optimisation (#4952)

5246506

stevejgordon mentioned this pull request Jun 16, 2021

NameValueCollection.ToQueryString performance optimisation #5731

Merged

stevejgordon added v7.14.0 and removed v8.0.0-alpha1 labels Mar 24, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NameValueCollection.ToQueryString performance optimisation #4952

NameValueCollection.ToQueryString performance optimisation #4952

stevejgordon commented Aug 10, 2020 •

edited

Loading

elasticmachine commented Aug 10, 2020

Mpdreamz left a comment

stevejgordon commented Aug 13, 2020

Mpdreamz Aug 13, 2020

Mpdreamz commented Aug 13, 2020

NameValueCollection.ToQueryString performance optimisation #4952

NameValueCollection.ToQueryString performance optimisation #4952

Conversation

stevejgordon commented Aug 10, 2020 • edited Loading

.NET Core 2.1 Benchmark Results

elasticmachine commented Aug 10, 2020

Mpdreamz left a comment

Choose a reason for hiding this comment

stevejgordon commented Aug 13, 2020

Mpdreamz Aug 13, 2020

Choose a reason for hiding this comment

Mpdreamz commented Aug 13, 2020

stevejgordon commented Aug 10, 2020 •

edited

Loading