Projects
fantasy football
https://github.com/joeeoj/fantasy-football (nbviewer link to render the Altair graphs)
This was a project to learn NFL fantasy football strategy and practice data wrangling and analysis. It gave me a chance to try out Altair and test out mkdocs for writing project documentation (using this great tutorial by calmcode).
2021/2022 results: 3rd place – not bad considering I got second to last the year before (I had no idea what I was doing that year). Like most people, my biggest area for improvement is evaluating and making trades.
dolthub data bounties
Dolt is Git + MySQL. The company that created it, Dolthub, hosts data bounties to incentivize contribution to datasets. It has given me an opportunity to practice scraping websites with Python, pandas data manipulation, and SQL. Here are the data bounties I’ve contributed too:
recipes
http://whatsfordinner.recipes/
A static website hosted by Netlify with my favorite recipes and some editorial notes. Great for last minute dinner planning. All the credit goes to jeffThompson as this is based on his original Recipes website with modifications by kvpsky. This falls well within the free tier of Netlify so the only cost was the domain for about $10. Adding new recipes is as simple as adding markdown pages, regenerating the html, and pushing changes to main (repo).
mlbcal
https://github.com/joeeoj/mlbcal
A CLI wrapper around the MLB Stats API schedule endpoint to generate csv and json reports of a team’s schedule. Publishing this to PyPI helped me better understand Python packaging (pro tip: TestPyPI is great, use it!).
Example usage:
$ mlbcal seattle --csv --nopre > mariners_schedule.csv
lahnman-to-duckdb
https://github.com/joeeoj/lahnman-to-duckdb
A wrapper around the baseballdatabank (aka Lahnman database) to convert the CSV files into a DuckDB database for faster, local analytics of baseball data. I heard about DuckDB in a Data Engineering Podcast episode. It is kind of like the OLAP version of SQLite.