Building a pipeline to extract patterns from 11.6GB of Claude Code session data — and discovering the series was using the wrong numbers