Question

我有一些不同字符串的文件（从prod中获取大约100.000）。需要找出99％，99.9％的函数来处理该文件中的每个字符串。

我尝试使用 jmh 来编写基准。但是，我能够仅为批处理函数（处理整个文件）或仅针对具有一个特定字符串的所需函数找出所需的百分位数。

public String process1(String str){
    ...process...
}

public String processBatch(List<String> strings){
    for (String str: strings){
        process1(str)
    }
}

另外，我试图通过@param设置整个字符串列表。这使jmh为每个字符串运行了几十次迭代，但没有找到所需的结果。

jmh中有什么可以帮助找到所需的统计数据吗？如果没有，可以使用什么工具？

Answer 1

这是你在找什么？

@Warmup(iterations = 1, time = 5, timeUnit = TimeUnit.SECONDS)
@Measurement(iterations = 1, time = 5, timeUnit = TimeUnit.SECONDS)
@Fork(1)
@State(Scope.Benchmark)
public class MyBenchmark {

    ClassUnderBenchmark classUnderBenchmark = new ClassUnderBenchmark();

    @State(Scope.Benchmark)
    public static class MyTestState {

        int counter = 0;
        List<String> list = Arrays.asList("aaaaa", "bbbb", "ccc");
        String currentString;

        @Setup(Level.Invocation)
        public void init() throws IOException {
            this.currentString = list.get(counter++);
            if (counter == 3) {
                counter = 0;
            }
        }
    }

    @Benchmark
    @Threads(1)
    @BenchmarkMode(Mode.SampleTime)
    public void test(MyBenchmark.MyTestState myTestState) {
        classUnderBenchmark.toUpper(myTestState.currentString);
    }

    public static class ClassUnderBenchmark {

        Random r = new Random();

        public String toUpper(String name) {
            try {
                Thread.sleep(r.nextInt(100));
            } catch (InterruptedException e) {
                e.printStackTrace();
            }
            return name.toUpperCase();
        }
    }

    public static void main(String[] args) throws RunnerException {
        Options opt = new OptionsBuilder()
                .include(MyBenchmark.class.getSimpleName())
                .jvmArgs("-XX:+UseG1GC", "-XX:MaxGCPauseMillis=50")
                .build();
        new Runner(opt).run();
    }
}

请参阅javadoc（org.openjdk.jmh.annotations.Mode）：

/**
 * <p>Sample time: samples the time for each operation.</p>
 *
 * <p>Runs by continuously calling {@link Benchmark} methods,
 * and randomly samples the time needed for the call. This mode automatically adjusts the sampling
 * frequency, but may omit some pauses which missed the sampling measurement. This mode is time-based, and it will
 * run until the iteration time expires.</p>
 */
SampleTime("sample", "Sampling time"),

此测试将为您提供输出：

Result "test":

  N = 91
  mean =      0,056 ±(99.9%) 0,010 s/op

  Histogram, s/op:
    [0,000, 0,010) = 6 
    [0,010, 0,020) = 9 
    [0,020, 0,030) = 3 
    [0,030, 0,040) = 11 
    [0,040, 0,050) = 8 
    [0,050, 0,060) = 11 
    [0,060, 0,070) = 9 
    [0,070, 0,080) = 9 
    [0,080, 0,090) = 14 

  Percentiles, s/op:
      p(0,0000) =      0,003 s/op
     p(50,0000) =      0,059 s/op
     p(90,0000) =      0,092 s/op
     p(95,0000) =      0,095 s/op
     p(99,0000) =      0,100 s/op
     p(99,9000) =      0,100 s/op
     p(99,9900) =      0,100 s/op
     p(99,9990) =      0,100 s/op
     p(99,9999) =      0,100 s/op
    p(100,0000) =      0,100 s/op


Benchmark           Mode  Cnt  Score   Error  Units
MyBenchmark.test  sample   91  0,056 ± 0,010   s/op

java微基准从列表中找到平均值

1 个答案: