Benchmarking (Internal)

This page documents the internal benchmarking tools used for performance testing py-draughts.

Performance Results

The following benchmarks were run on January 7, 2026.

Performance of the legal moves generator across different board positions:

AlphaBeta engine search performance at different depths:

The benchmarking system compares performance between different versions of py-draughts by:

Create a Snapshot

First, create a snapshot of the version you want to use as baseline:
```
python tools/create_snapshot.py
```
This will:
- Build a wheel file from the current source
- Save it to snapshots/snapshot_YYYYMMDD_HHMMSS/
- Store metadata (git commit, timestamp) in metadata.json
Compare Versions

Then compare the snapshot against your current (modified) source:
```
# Compare latest snapshot vs current source
python tools/compare_versions.py

# Compare specific snapshot vs current source
python tools/compare_versions.py snapshots/snapshot_20251231_125057
```
The comparison runs:
- Legal moves benchmark: Measures time to generate legal moves from various positions
- Engine match: Plays games between engines to compare move quality and speed

The benchmark configuration is defined at the top of tools/compare_versions.py:

WARMUP_ROUNDS = 5
BENCHMARK_ROUNDS = 10
BENCHMARK_ITERATIONS = 10
ENGINE_DEPTH = 2
NUM_GAMES = 20

Benchmark results are automatically appended to benchmark_results.csv in the project root.

Recommended workflow for improving engine performance:

Profile

Run profiling to identify bottlenecks:

# For general engine profiling
python tools/profile_engine_detailed.py

# For legal moves generation specifically
python tools/profile_legal_moves.py

Improve

Make code changes to address identified bottlenecks.
Verify Profile

Run the profiling script again to verify local improvements.
Test

Ensure no regressions in functionality:
```
pytest .
```
Benchmark

Compare against the baseline snapshot to verify performance gains:
```
python tools/compare_versions.py
```
Snapshot

If satisfied with the results, create a new snapshot to serve as the next baseline:
```
python tools/create_snapshot.py
```

For detailed profiling of specific functions, use:

python tools/profile_engine.py

Or for more detailed analysis:

python tools/profile_engine_detailed.py 5  # depth 5

This uses Python’s cProfile to identify bottlenecks in the engine.

To regenerate the benchmark charts and tables for documentation:

python tools/generate_benchmark_charts.py

This will:

The generated charts include:

legal_moves_benchmark.png - Bar chart of legal moves generation time by position type
engine_benchmark.png - Combined time and nodes chart for engine depths
engine_depth_time.png - Logarithmic time chart by depth
engine_depth_nodes.png - Logarithmic nodes searched chart by depth