Data scientist with 20+ years of experience modelling complex, noisy systems — originally in astrophysics, now applied to real-world data problems.
Published machine-learning researcher with 20+ years of quantitative modelling experience, now applying those skills to commercial problems.
My work focuses on machine learning, statistical inference and time series analysis, with an emphasis on extracting signal from difficult datasets and supporting decision-making under uncertainty.
Machine Learning
- Classification
- Regression
- Neural Networks
Statistics
- A/B Testing
- Confidence Intervals
- Hypothesis Testing
Forecasting
- ARIMA
- Holt-Winters
- Prophet
Programming
- Python
- SQL
- Git
- C
- IDL
- Unix shell
-
Developed supervised models to identify rare events in highly imbalanced data, achieving strong precision while maintaining a low alert rate. Focused on threshold optimisation, uncertainty, and real-world trade-offs between detection and operational cost.
-
Built reusable Python tools for comparing groups using confidence intervals and hypothesis testing. Designed to support practical decision-making in experimentation workflows.
-
Created an interactive framework for comparing forecasting models (ARIMA, Holt-Winters, Prophet) with built-in backtesting and error analysis to evaluate real-world performance.
-
Implemented neural network models for continuous parameter estimation from high-dimensional data, including full pipelines for preprocessing, training, validation, and uncertainty assessment.
Python (pandas, NumPy, scikit-learn, TensorFlow), statistical modelling, machine learning, time series forecasting, hypothesis testing, data visualisation.
| Technical Skills | Soft Skills | Python | Other languages | Documentation |
|---|---|---|---|---|
| Data Analysis | Team Leadership | dash | C | HTML |
| Machine Learning | Project Management | jupyter | IDL | Latex |
| Neural Networks | Teaching & Supervision | matplotlib | PHP | Markdown |
| Data Visualisation | Science Communication | numpy | SQL | Plotly dashboards |
| Statistical Analysis | Public Speaking | pandas | Shell scripting | Tableau |
| Scientific Research | TV and Radio | scikit-learn | Pgplot | Office |
| Simulations | International Collaboration | tensorflow | Gnuplot |

