README - OpenGrok cross reference for /third_party/python/Tools/stringbench/README

7db96d56Sopenharmony_cistringbench is a set of performance tests comparing byte string
7db96d56Sopenharmony_cioperations with unicode operations.  The two string implementations
7db96d56Sopenharmony_ciare loosely based on each other and sometimes the algorithm for one is
7db96d56Sopenharmony_cifaster than the other.
7db96d56Sopenharmony_ci
7db96d56Sopenharmony_ciThese test set was started at the Need For Speed sprint in Reykjavik
7db96d56Sopenharmony_cito identify which string methods could be sped up quickly and to
7db96d56Sopenharmony_ciidentify obvious places for improvement.
7db96d56Sopenharmony_ci
7db96d56Sopenharmony_ciHere is an example of a benchmark
7db96d56Sopenharmony_ci
7db96d56Sopenharmony_ci
7db96d56Sopenharmony_ci@bench('"Andrew".startswith("A")', 'startswith single character', 1000)
7db96d56Sopenharmony_cidef startswith_single(STR):
7db96d56Sopenharmony_ci    s1 = STR("Andrew")
7db96d56Sopenharmony_ci    s2 = STR("A")
7db96d56Sopenharmony_ci    s1_startswith = s1.startswith
7db96d56Sopenharmony_ci    for x in _RANGE_1000:
7db96d56Sopenharmony_ci        s1_startswith(s2)
7db96d56Sopenharmony_ci
7db96d56Sopenharmony_ciThe bench decorator takes three parameters.  The first is a short
7db96d56Sopenharmony_cidescription of how the code works.  In most cases this is Python code
7db96d56Sopenharmony_cisnippet.  It is not the code which is actually run because the real
7db96d56Sopenharmony_cicode is hand-optimized to focus on the method being tested.
7db96d56Sopenharmony_ci
7db96d56Sopenharmony_ciThe second parameter is a group title.  All benchmarks with the same
7db96d56Sopenharmony_cigroup title are listed together.  This lets you compare different
7db96d56Sopenharmony_ciimplementations of the same algorithm, such as "t in s"
7db96d56Sopenharmony_civs. "s.find(t)".
7db96d56Sopenharmony_ci
7db96d56Sopenharmony_ciThe last is a count.  Each benchmark loops over the algorithm either
7db96d56Sopenharmony_ci100 or 1000 times, depending on the algorithm performance.  The output
7db96d56Sopenharmony_citime is the time per benchmark call so the reader needs a way to know
7db96d56Sopenharmony_cihow to scale the performance.
7db96d56Sopenharmony_ci
7db96d56Sopenharmony_ciThese parameters become function attributes.
7db96d56Sopenharmony_ci
7db96d56Sopenharmony_ci
7db96d56Sopenharmony_ciHere is an example of the output
7db96d56Sopenharmony_ci
7db96d56Sopenharmony_ci
7db96d56Sopenharmony_ci========== count newlines
7db96d56Sopenharmony_ci38.54   41.60   92.7    ...text.with.2000.newlines.count("\n") (*100)
7db96d56Sopenharmony_ci========== early match, single character
7db96d56Sopenharmony_ci1.14    1.18    96.8    ("A"*1000).find("A") (*1000)
7db96d56Sopenharmony_ci0.44    0.41    105.6   "A" in "A"*1000 (*1000)
7db96d56Sopenharmony_ci1.15    1.17    98.1    ("A"*1000).index("A") (*1000)
7db96d56Sopenharmony_ci
7db96d56Sopenharmony_ciThe first column is the run time in milliseconds for byte strings.
7db96d56Sopenharmony_ciThe second is the run time for unicode strings.  The third is a
7db96d56Sopenharmony_cipercentage; byte time / unicode time.  It's the percentage by which
7db96d56Sopenharmony_ciunicode is faster than byte strings.
7db96d56Sopenharmony_ci
7db96d56Sopenharmony_ciThe last column contains the code snippet and the repeat count for the
7db96d56Sopenharmony_ciinternal benchmark loop.
7db96d56Sopenharmony_ci
7db96d56Sopenharmony_ciThe times are computed with 'timeit.py' which repeats the test more
7db96d56Sopenharmony_ciand more times until the total time takes over 0.2 seconds, returning
7db96d56Sopenharmony_cithe best time for a single iteration.
7db96d56Sopenharmony_ci
7db96d56Sopenharmony_ciThe final line of the output is the cumulative time for byte and
7db96d56Sopenharmony_ciunicode strings, and the overall performance of unicode relative to
7db96d56Sopenharmony_cibytes.  For example
7db96d56Sopenharmony_ci
7db96d56Sopenharmony_ci4079.83 5432.25 75.1    TOTAL
7db96d56Sopenharmony_ci
7db96d56Sopenharmony_ciHowever, this has no meaning as it evenly weights every test.
7db96d56Sopenharmony_ci