Hi, I'm

Niilo Pääkkönen

M.Sc. Data Science · Software & Data Engineer

Master of Data Science with a background in software engineering and data engineering. I conduct full data science projects end-to-end — from data pipelines and classical ML to LLM-powered systems and production APIs.

About

Master of Data Science based in Finland, specializing in end-to-end AI and analytics solutions. I bridge the gap between raw data and production-ready applications by combining robust software engineering with advanced machine learning. My focus is on delivering reliable and maintainable systems. I leverage modern AI coding assistants to accelerate development while maintaining strict quality control and rigorous code review standards for production safety.

My interest in data science sparked during my computer science bachelor's degree, motivating me to focus heavily on math and statistics to build a strong foundation for data science. Getting my master's degree was a rewarding journey that taught me how to apply deeply theoretical concepts to real, hands on tasks.

Outside of work and personal projects, creativity is a huge part of my life. Whether I am producing music, playing the piano, DJing or playing video games and tabletop RPGs, I find that my creative mindset directly helps me tackle challenges in unique ways.

📍 Finland

Classical Machine Learning

Supervised and unsupervised methods — regression, classification, random forest, SVM, clustering, dimensionality reduction, and model selection with rigorous evaluation.

LLM & Agentic AI

RAG pipelines, multi-step agents, tool-use architectures with LangChain, and comfortable use of agentic coding tools with rigorous code review practices.

Software & Backend Engineering

Production FastAPI services, full stack development, REST APIs, async job queues, and analytical Jupyter notebooks for business insight derivation.

Data Engineering

ETL pipelines, data collection and cleansing, stream processing with Kafka, vector databases, PostgreSQL, and cloud platforms — Fabric, Azure, Snowflake, Databricks.

Analytics & Insight

Exploratory data analysis, statistical storytelling, Jupyter notebooks, data visualisation, and translating findings into clear business recommendations.

Projects

Project Preview

Stock Sentiment

Collects historical and real-time stock news via the Alpaca API and runs FinBERT sentiment analysis on each article. Users select a time frame and get a breakdown of positive, neutral, and negative news for any ticker. News stored in Azure Blob Storage and cleaned in Databricks using a medallion architecture.

Python FinBERT Alpaca API Azure Blob Databricks FastAPI
Project Preview

My FastAPI Template

Production-ready FastAPI template used as the foundation for most of my Python software projects. Includes RAG with pgvector, persistent chat, JWT auth, async background jobs, and pluggable LLM support.

FastAPI LangChain pgvector PostgreSQL Docker
Project Preview

ML Sandbox

Platform for uploading and hosting ML models and datasets, with agentic sentiment analysis on uploaded data. Supports model management, dataset exploration, and AI-driven insight generation.

Python FastAPI LangChain Scikit-learn Pandas Docker
Project Preview

This Portfolio

Portfolio site built with FastAPI and Jinja2 — serves both the HTML frontend and a public REST API. Deployed on Render via Docker with GitHub Actions CI/CD.

FastAPI Jinja2 Docker GitHub Actions Render

Skills

Languages

Python SQL C++ Bash

ML & AI

Scikit-learn TensorFlow LangChain Pandas NumPy Random Forest SVM Regression Deep Learning Target Marketing

Backend & Data

FastAPI PostgreSQL SQLAlchemy Alembic pgvector Kafka Docker

Cloud & Platforms

Azure Microsoft Fabric Snowflake Databricks Git Jira

Contact

Prefer direct contact? Reach me at:

paakkonenniilo@gmail.com GitHub LinkedIn