Bridging data, science & strategy ๐ Machine Learning ๐ ๏ธ Tool Development ๐ฆ R Software ๐งญ Leadership ๐งฌ Life Sciences Domain Expert โจ Director of Data Science @ Cercle.ai
๐ฌ Domain expertise: Proteomics, biomarker discovery, diagnostics, life sciences, predictive modeling
๐ Technical tools: R, machine learning, statistics, Python, experimental design, reproducible research
๐ช Strengths: Translating complexity, cross-functional collaboration, storytelling with data
About Me"Making predictions is easy ... making accurate ones is much more difficult." โฏ Meโฏ
I love to solve problems.
Often the problem can be understanding a complex biological process, but it can also be as simple as fixing something that's broken (e.g. a door that jams, a bicycle, or even machine learning software). In particular, I like to apply my data science skills to better understand, or even solve, the problems we face.
Over the past 12+ years I have combined my statistical knowledge and Open-Source Software tools to solve complex problems in the Life Sciences proteomics (high dimensional) space. In so doing, I have created a comprehensive R-based machine learning analysis ecosystem that standardizes and enables biomarker discovery and predictive model development.
Sometimes the problem is inconsistency across teams or analysts ... thus I promote adherence of "tidy" data principles and am a strong proponent reproducible research and use of bioinformatics pipelines.
Other times the problem can be sharing results across the organization ... thus developing an Application Program Interface (API) infrastructure that enables anyone to access model results with ease.
With my teaching background, I find it important to mentor junior team members while simultaneously leading more senior members. This collaborative spirit is essential to building and effective team that delivers to stakeholders, fosters a sense of accomplishment, and drives revenue generation.
I am always open to discuss possible roles ๐ญ and whether my skill set can solve problems in your space!
Machine Learning ๐ Statistics ๐ Open-Source ๐ป Software Tools ๐ง Random Forest Logistic regression R Linux๐ง, MacOS ๐ Naive Bayes Linear regression C++ Git, GitHub Lasso/ridge regression GLMMs Python ๐ AWS k-Nearest neighbour Mixed-effects models LaTeX BASH, GNU PCA Survival analysis CI/CD BitBucket Ensemble methods Multivariate statistics Docker ๐ Slack Maximum Likelihood ANOVA KubernetesR
... I'll talk your ๐ off!R
software libraries (๐ฆ) that implement statistical and machine learning techniques in biomarker discovery. Some of my popular published ๐ฆ are:
RStudio/Posit
IDE.RetroSearch is an open source project built by @garambo | Open a GitHub Issue
Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo
HTML:
3.2
| Encoding:
UTF-8
| Version:
0.7.4