I'm James, an engineer / data scientist from Chicago. My time on GitHub is mostly spent writing Python, R, and shell scripts on projects for data scientists and data engineers. My time off GitHub is spent with family, at hip hop shows, and watching reality TV.
open source stuff I'm maintainingThe pull requests and none-code contributions below were chosen to showcase the types of software work I've done. This list is not exhaustive.
lightgbm
and xgboost
to scikit-learn
1.6:
conda
packages for legate-boost
, legate-dataframe
, and legate-raft
: rapidsai/legate-boost#115setup.py
with scikit-build-core
for PEP 517/518 compatibility: microsoft/LightGBM#5759dask-lightgbm
into LightGBM and guiding community discussion with Dask, XGBoost maintainers
Webhook
storage to prefect
: PrefectHQ/prefect#3000autoconf
-based builds of LightGBM's R package: microsoft/LightGBM#3188snowflake-connector-python
compatible with pyjwt
1.x and 2.x: snowflakedb/snowflake-connector-python#604{lightgbm}
by ignoring CLI-only objects: microsoft/LightGBM#3566prefect
kubernetes agent: PrefectHQ/prefect#3596httr::RETRY()
in various R packages
{sergeant}
(one example): hrbrmstr/sergeant#42dask-cloudprovider
: dask/dask-cloudprovider#156lightgbm
:
pandas
2.0 wheels: pandas-dev/pandas#51900conda
from "downgrading" Python from CPython to PyPy, while also reducing the risk of a subtle networking error made worse by unpredictability in when Dask garbage collects objects (microsoft/LightGBM#5510)lightgbm
loading failing with GLIBCXX
compatibility errors: microsoft/LightGBM#5106 (comment)jupyter_server
conda-forge feedstock recipe to prevent broken environments: conda-forge/jupyter_server-feedstock#84dask-ml
regression metrics consistent with scikit-learn
: dask/dask-ml#820cuml
: rapidsai/cuml#3388mm_malloc
in {lightgbm}
autoconf-based builds: microsoft/LightGBM#3510{lightgbm}
's docs site: microsoft/LightGBM#3508gendef.exe
for compiling XGBoost and LightGBM R packages with Visual Studio compilers and R 4.0:
{xgboost}
: dmlc/xgboost#5764{lightgbm}
: microsoft/LightGBM#3065fmt
/ spdlog
: rapidsai/build-planning#56manylinux_2_28
: microsoft/LightGBM#5580prefect-saturn
to PyPI when a new release is created: saturncloud/prefect-saturn#7{uptasticsearch}
CI to GitHub Actions: uptake/uptasticsearch#217{lightgbm}
within ASAN and UBSAN sanitizers: microsoft/LightGBM#3439I've given talks on Dask, LightGBM, R, Python packaging, and other random stuff. For a full list and links to videos, see https://github.com/jameslamb/talks#gallery.
My DMs are open if you want to talk about open source, data science careers, Bravo shows, or anything else.
RetroSearch is an open source project built by @garambo | Open a GitHub Issue
Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo
HTML:
3.2
| Encoding:
UTF-8
| Version:
0.7.4