Galaxy Stats

Galaxy Stats is a comprehensive ETL pipeline that processes tournament data from galaxy.fun into interactive, self-contained HTML reports. The system uses Python with BeautifulSoup for HTML parsing, NumPy and SciPy for statistical analysis including hierarchical clustering to identify card archetypes, and generates standalone reports with embedded CSS and JavaScript. The 6-stage pipeline includes parse, analyze, finalize, render, copy, and update phases with GitHub Pages deployment.