Public data,
real findings.
We analyze publicly available data sources to uncover patterns, trends, and findings that matter. Independent research, transparent methodology.
Current WorkCurrent Work
Medicaid Spending Records
Multi-part investigation into U.S. Medicaid provider spending using publicly available CMS data. Statistical anomaly detection, machine learning, and geographic clustering to identify where taxpayer dollars may be at risk.
$530B Under the Microscope (2019–2022)
128M records, 739K providers. Z-score composites, Pareto analysis, and billing pattern anomalies.
View Part I$1.09T Fraud Detection — 34,710 Red Flags (2018–2024)
227M records, 617K providers. Isolation Forest ML, DBSCAN geographic clustering, and 8-dimensional fraud scoring.
View Part IIElecciones Colombia 2026
Citizen participation platform for Colombia's 2026 presidential elections. Advanced statistical modeling with MRP (Multilevel Regression with Poststratification).
Our Approach
Public Data Sources
We work exclusively with publicly available datasets, ensuring transparency and reproducibility in all our analyses.
Statistical Rigor
Advanced modeling including Bayesian inference, MRP, Monte Carlo simulations, and ecological regression for robust findings.
Open Methodology
Our methods are documented and transparent. We publish our approach so others can verify and build upon our work.