GaiskaSalomon GaiskaSalomon

👋 Hi, I'm Gaiska Salomón

🎓 Ph.D. Candidate in Statistics & Data Science — Machine Learning · Time Series · LLMs

🚀 About Me

Data scientist and researcher with a strong statistical foundation (probability, Bayesian methods, time-series). I build the full lifecycle — from data pipelines and feature engineering to model training, rigorous validation, and deployment — across machine learning, deep learning, and LLMs. Recent work spans commodity-return forecasting, urban-mobility analytics, and domain-specific Spanish language models.

🔭 Currently: applying ML/DL to real problems and shipping reproducible, documented projects.
🌱 Comfortable from classic ML & statistics to LLM fine-tuning + RAG.
🗣️ Spanish (native) · English (intermediate, conversational).

🔬 Focus Areas

Machine Learning & Deep Learning — predictive modeling, gradient boosting, neural nets.
Time Series & Forecasting — walk-forward validation, backtesting, high-frequency data.
Statistical Modeling — inference, Bayesian methods, uncertainty quantification.
LLMs / NLP — fine-tuning (LoRA/QLoRA), retrieval-augmented generation (RAG).
Applied Data Science — data pipelines, dashboards, and clear communication of results.

🛠️ Tech Stack

Languages

ML / Deep Learning

LLMs / NLP

Data & Tooling

📌 Featured Projects

📈 climate-commodity-alpha-lab

Quantitative research: do weather & climate-risk features improve commodity return forecasts? Walk-forward validation, XGBoost/LightGBM, Bayesian methods, cost-aware backtesting (Sharpe, IC, drawdown). Python · XGBoost · LightGBM · PyMC · time-series · backtesting

🚲 CDMX Mobility Pulse

Reproducible pipeline + interactive dashboard for Mexico City mobility (GTFS, ECOBICI GBFS, C5). Ingestion, data-quality reports, KPIs, and 7-day demand forecasting. Python · Streamlit · data-pipeline · XGBoost / LightGBM / CatBoost

🤖 AgroLLM-ES

Domain-specific Spanish LLM pipeline: dataset cleaning/deduplication, QLoRA fine-tuning (HuggingFace + TRL + PEFT), and RAG on PostgreSQL + pgvector with an evaluation suite. PyTorch · Hugging Face · QLoRA · RAG · pgvector

📊 GitHub Stats

“Transforming data into actionable insights is not just my profession, it's my passion.”

Provide feedback

Saved searches

Use saved searches to filter your results more quickly