Add support for a reducer step that aggregates per-user metrics #76

csgregorian · 2019-02-13T22:57:39Z

The goal of this PR is to let Dynamometer collect and emit per-user metrics, as opposed to the whole-workload measurements that it currently collects using Counters. This is done by first changing the AuditReplayMapper to emit key-value pairs in the form of (username_type, latency) where type is either READ or WRITE. Then, a reducer step AuditReplayReducer is added to sum latencies per key. More generally, this change allows Dynamometer to support arbitrary reducers for stats aggregation along with the existing mappers.

I modified TestWorkloadGenerator to run with and without a reducer, and confirm that the file was created. This still needs a test to ensure that the actual contents of the output file are correct. As well, it's possible that I changed some things that didn't need to be changed in the process of getting the tests to pass/output to appear.

Mostly trying to see if CI will pass, but ready for some preliminary reviews.

EDIT: a couple other things that I'm not sure are the right approach and need to be looked at a bit closer:

Not sure whether extending WorkloadMapper to specify KEYOUT and VALUEOUT as parameters is the right move
What happens if AuditReplayMapper writes to the context without a reducer being specified? Should job.setMapOutputKeyClass be set to LongWritable.class instead of NullWritable.class regardless of whether it's actually outputting anything? It doesn't seem like this caused any errors, but if not, why?
I simplified WorkloadReducer just to what it needs to run, but it should probably have getDescription and getConfigDescriptions as well.

xkrogen

Hey @csgregorian , this is a great start! Besides the inline comments, I have two general comments:

We should add a Combiner to reduce the data transfer volume between Mappers and Reducers.
I'm wondering if it makes sense for the WorkloadReducer to be user-configurable. I'm thinking it may be better for it to be specified by the mapper, like how a WorkloadMapper specifies what input format it expects. Since the output keys/values of the mapper must be exactly matched to the input keys/values of the reducer, I don't think it makes sense to configure them independently. Let me know what you think.

...ometer-workload/src/main/java/com/linkedin/dynamometer/workloadgenerator/WorkloadDriver.java

...meter-workload/src/main/java/com/linkedin/dynamometer/workloadgenerator/WorkloadReducer.java

...kload/src/main/java/com/linkedin/dynamometer/workloadgenerator/audit/AuditReplayReducer.java

...rkload/src/main/java/com/linkedin/dynamometer/workloadgenerator/audit/AuditReplayThread.java

...ad/src/main/java/com/linkedin/dynamometer/workloadgenerator/audit/AuditTextOutputFormat.java

...workload/src/test/java/com/linkedin/dynamometer/workloadgenerator/TestWorkloadGenerator.java

csgregorian · 2019-02-15T21:45:17Z

Simplified a ton of stuff here.

Reducers are local to a WorkloadMapper instead of being user configured
WorkloadMappers handle their own configuration
AuditReplayThreads no longer write directly to context, instead aggregating all of their command latencies locally. These are then collected during the AuditReplayMapper cleanup, similar to the drainCounters behaviour. This obviates the need for a combiner, as the maximum number of inputs to the Reducer is now unique keys * threads per mapper * mappers, instead of commands per thread * threads per mapper * mappers
Changed AuditReplayMapper/Reducer key type to a custom composite writable userCommandKey (not simplified, but the right way to do it [probably])
Extended TestWorkloadGenerator to test the output file contents

Ready for more reviews!

xkrogen

Nice, I love all of the changes. I have mostly minor comments throughout, and my only general comment would be that we should update TestDynamometerInfra in some way. We're already asserting on the output within TestWorkloadGenerator, but maybe in the infra test we can at least assert that the -workload_output_path arg works as expected and puts the file in the right place?

...eter-workload/src/main/java/com/linkedin/dynamometer/workloadgenerator/CreateFileMapper.java

...kload/src/main/java/com/linkedin/dynamometer/workloadgenerator/audit/AuditReplayReducer.java

...rkload/src/main/java/com/linkedin/dynamometer/workloadgenerator/audit/AuditReplayThread.java

...-workload/src/main/java/com/linkedin/dynamometer/workloadgenerator/audit/UserCommandKey.java

...kload/src/main/java/com/linkedin/dynamometer/workloadgenerator/audit/AuditReplayReducer.java

...-workload/src/main/java/com/linkedin/dynamometer/workloadgenerator/audit/UserCommandKey.java

...workload/src/test/java/com/linkedin/dynamometer/workloadgenerator/TestWorkloadGenerator.java

xkrogen

Few minor style nits and then it's all good to go!

...workload/src/test/java/com/linkedin/dynamometer/workloadgenerator/TestWorkloadGenerator.java

...-workload/src/main/java/com/linkedin/dynamometer/workloadgenerator/audit/UserCommandKey.java

...kload/src/main/java/com/linkedin/dynamometer/workloadgenerator/audit/AuditReplayReducer.java

Christopher Gregorian added 2 commits February 13, 2019 12:03

Reducer support and per-user aggregation

77ed60a

Clean up

3bc3998

xkrogen requested changes Feb 14, 2019

View reviewed changes

Refactor and clean up

67873d7

csgregorian force-pushed the cgregori-per-user-metrics branch from 757d342 to 67873d7 Compare February 14, 2019 23:38

Christopher Gregorian added 3 commits February 14, 2019 15:54

tests pass!

4c84065

Fix up exception handling

e5da741

Add UserCommandKey type

4ddcc07

csgregorian force-pushed the cgregori-per-user-metrics branch from 9b61ead to dc3e754 Compare February 15, 2019 18:15

More clean up

75de589

csgregorian force-pushed the cgregori-per-user-metrics branch from dc3e754 to 75de589 Compare February 15, 2019 18:15

Add output check to unit test

21b6566

xkrogen requested changes Feb 15, 2019

View reviewed changes

Last fixes and javadoc

6538122

xkrogen reviewed Feb 15, 2019

View reviewed changes

Style

500b5cf

xkrogen approved these changes Feb 16, 2019

View reviewed changes

xkrogen merged commit bfbfb56 into linkedin:master Feb 16, 2019

tasanuma mentioned this pull request Oct 30, 2019

HDFS-14824. [Dynamometer] Dynamometer in org.apache.hadoop.tools does not output the benchmark results. apache/hadoop#1685

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for a reducer step that aggregates per-user metrics #76

Add support for a reducer step that aggregates per-user metrics #76

csgregorian commented Feb 13, 2019 •

edited

Loading

xkrogen left a comment

csgregorian commented Feb 15, 2019 •

edited

Loading

xkrogen left a comment

xkrogen left a comment

Add support for a reducer step that aggregates per-user metrics #76

Add support for a reducer step that aggregates per-user metrics #76

Conversation

csgregorian commented Feb 13, 2019 • edited Loading

xkrogen left a comment

Choose a reason for hiding this comment

csgregorian commented Feb 15, 2019 • edited Loading

xkrogen left a comment

Choose a reason for hiding this comment

xkrogen left a comment

Choose a reason for hiding this comment

csgregorian commented Feb 13, 2019 •

edited

Loading

csgregorian commented Feb 15, 2019 •

edited

Loading