Yuqi Zheng

Quant Developer & DevOps Engineer Hedge Fund

writing

Low-Latency Trading Machine: A Practical Linux Kernel Tuning Guide
Parsing B3 Exchange Binary UMDF Market Data: From pcap to Order Book
TSE Itayose Auction Pricing: The Five-Condition Algorithm
Cache-Friendly Maps: flat_map and unordered_flat_map vs the STL
Linux rp_filter and Why It Drops Your Trading Packets
Empty Base Optimization (EBO) and [[no_unique_address]]: Zero-Cost Abstraction in Practice
Memory Models, Caches, and Pipelines: The Hardware Behind Concurrency
_GLIBCXX_USE_CXX11_ABI=0: Why the Old ABI Is Slow
Ray Async Internals (5): Compile-Time Thread Isolation in GCS
Replace const vector& with std::span: Zero-Cost Generality
String Split Performance: string_view Is Not Optional
Ray Async Internals (4): Bridging gRPC and Asio
Ray Async Internals (3): Thread Pool and Periodic Timer
Ray Async Internals (2): Event Loop Observability and Chaos Testing
Ray Async Internals (1): Asio's Role and `instrumented_io_context`
Auto-Generating C++ Operators with LibClang
Byte Order Reversal in C++: Bit Twiddling vs. Compiler Builtins
Immediately Invoked Function Expressions (IIFE) in C++
Implementing a Correct C++ Spinlock: TAS, TTAS, and PAUSE
Optimizing a Ring Buffer for Throughput
POSIX Sockets API: Tips and Pitfalls
Ray Internals: The Task, Actor, and Object Model
Split Locks: The Hidden Cost of Cache-Line-Crossing Atomic Operations
Structure-Aware Fuzzing for Floating-Point Code with libFuzzer
Using Huge Pages on Linux
Virtual Memory and Latency: A Practical Guide for Low-Latency Systems
C++ Async Safety: Guarding Object Lifetime with `weak_from_this`
PEP 574: Pickle Protocol 5 and Out-of-Band Data
Python Multiprocessing: Zero-Copy Data Transfer with shared_memory and Protocol 5
C++ Async Callbacks: Lambda Capture and the Destruction Order Fiasco
Compiler Optimization: Loop Unrolling
Compiler Optimization: Loop Style Doesn't Matter
Compiler Optimization: The Cost of Division
Compiler Optimization: Using lea for Integer Arithmetic
Compiler Optimization: Loop-Invariant Code Motion (LICM)
Compiler Optimization: ARM's Barrel Shifter
Compiler Optimization: Partial Inlining
Compiler Optimization: Loop Unswitching
Compiler Optimization: Why xor eax, eax Instead of mov eax, 0
Compiler Optimization: How Pointer Aliasing Kills Performance
Compiler Optimization: Replacing Division with Multiplication
Compiler Optimization: Inlining —The Master Optimization
Compiler Optimization: Tail Call Optimization
Compiler Optimization: Why Floating-Point Resists Vectorization
Compiler Optimization: How Compilers Optimize switch Statements
Compiler Optimization: Induction Variables and Loop-Carried Dependencies
Compiler Optimization: Calling Conventions and Argument Passing
Compiler Optimization: When LICM Fails —The Aliasing Trap
Compiler Optimization: How Compilers Compare Fixed-Length Strings
Compiler Optimization: Multiplying by Constants
Compiler Optimization: Recognizing the popcount Pattern
Compiler Optimization: Clang's Algebraic Simplifications
Compiler Optimization: SIMD and Auto-Vectorization
view all →