Keyboard shortcuts

Press or to navigate between chapters

Press S or / to search in the book

Press ? to show this help

Press Esc to hide this help

File Source Benchmarks

Performance benchmarks for file-based data source scenarios using batch processing mode.

Purpose

Test file I/O and parsing performance with pre-generated data files.

Test Scenarios

ScenarioDescriptionValidated Features
parse_to_blackholeFile → Parse → DiscardFile reading + pure parsing throughput
parse_to_fileFile → Parse → FileComplete file-to-file parsing pipeline
trans_to_blackholeFile → Parse+Transform → DiscardParsing + OML transformation throughput
trans_to_fileFile → Parse+Transform → FileComplete transformation pipeline

Quick Start

cd benchmark

# Parse to blackhole (default: 20M lines, 6 workers)
./case_file/parse_to_blackhole/run.sh

# Medium dataset (200K lines)
./case_file/parse_to_blackhole/run.sh -m

# Custom configuration
./case_file/parse_to_file/run.sh -w 8 nginx

Data Flow

wpgen → gen.dat → wparse batch → sink (blackhole/file)

文件源基准测试 (中文)

基于文件数据源的性能基准测试,使用批处理模式。

用途

测试文件 I/O 和解析性能,使用预生成的数据文件。

测试场景

场景说明验证特性
parse_to_blackhole文件 → 解析 → 丢弃文件读取 + 纯解析吞吐量
parse_to_file文件 → 解析 → 文件完整文件到文件解析管道
trans_to_blackhole文件 → 解析+转换 → 丢弃解析 + OML 转换吞吐量
trans_to_file文件 → 解析+转换 → 文件完整转换管道

快速开始

cd benchmark

# 解析到黑洞(默认:2000 万行,6 个 worker)
./case_file/parse_to_blackhole/run.sh

# 中等规模数据集(20 万行)
./case_file/parse_to_blackhole/run.sh -m

# 自定义配置
./case_file/parse_to_file/run.sh -w 8 nginx

数据流向

wpgen → gen.dat → wparse batch → sink (黑洞/文件)