Hello

Pyae Sone Kyaw

Pyae Sone Kyaw

Pyae Sone Kyaw

FOUNDER · FULL-STACK & AI ENGINEER · PARIS
AI Engineering · Full-Stack Systems · Backend & Cloud Data
OPEN TO AI / FULL-STACK / BACKEND ROLES · FR / DE / UK
SCROLL
01 — ABOUT

AI Engineer · Full-Stack ·
Backend Systems

Pyae Sone Kyaw
5+
Yrs Engineering
12+
Projects Shipped
2
Master's Degrees
3
Countries Lived

AI Engineer · Full-Stack · Backend Systems — building production-grade software at the intersection of telecom, healthcare, and finance, across Asia and Europe.

Currently the founder of Ekkhara, a self-funded software & AI studio building EdTech for Myanmar — most recently SpeakProof, a Telegram-native TOEFL practice bot that works without a VPN. Before this I was a Full-Stack AI Engineer at Siloett.AI (Station F, Paris), architecting end-to-end Generative AI systems on Azure with FastAPI and React/TypeScript.

Beyond AI, my engineering portfolio spans backend systems — Java 21, Spring Boot 3.5, Kafka, microservices — and cloud data engineering on AWS, Snowflake, Databricks, dbt, and Airflow. Telecom domain depth includes CDR pipelines, SMPP gateways, and Diameter Credit-Control.

Dual Master's degrees from Telecom SudParis (Paris) and AIT (Bangkok), plus 5+ years shipping production software across healthcare, RegTech, telecom, and smart-city domains.

Currently exploring AI Engineer, Full-Stack, and Backend Engineer roles across France, Germany, and the UK. If you're hiring or know of a fit — let's talk.

TECH STACK
PyTorchPyTorch
TensorFlowTensorFlow
scikit-learnscikit-learn
Hugging FaceHugging Face
LangChainLangChain
OpenAIOpenAI
FAISSFAISS
PythonPython
JavaJava
TypeScriptTypeScript
JavaScriptJavaScript
SQLSQL
ReactReact
Next.jsNext.js
Tailwind CSSTailwind CSS
Spring BootSpring Boot
FastAPIFastAPI
Node.jsNode.js
DjangoDjango
ASP.NET CoreASP.NET Core
KafkaKafka
RabbitMQRabbitMQ
AzureAzure
AWSAWS
GCPGCP
VercelVercel
PostgreSQLPostgreSQL
MongoDBMongoDB
RedisRedis
SupabaseSupabase
Neo4jNeo4j
PineconePinecone
ChromaDBChromaDB
DockerDocker
KubernetesKubernetes
GitHub ActionsGitHub Actions
GitLab CIGitLab CI
TerraformTerraform
SnowflakeSnowflake
DatabricksDatabricks
dbtdbt
AirflowAirflow
Apache SparkApache Spark
PandasPandas
NumPyNumPy
JupyterJupyter
PyTorchPyTorch
TensorFlowTensorFlow
scikit-learnscikit-learn
Hugging FaceHugging Face
LangChainLangChain
OpenAIOpenAI
FAISSFAISS
PythonPython
JavaJava
TypeScriptTypeScript
JavaScriptJavaScript
SQLSQL
ReactReact
Next.jsNext.js
Tailwind CSSTailwind CSS
Spring BootSpring Boot
FastAPIFastAPI
Node.jsNode.js
DjangoDjango
ASP.NET CoreASP.NET Core
KafkaKafka
RabbitMQRabbitMQ
AzureAzure
AWSAWS
GCPGCP
VercelVercel
PostgreSQLPostgreSQL
MongoDBMongoDB
RedisRedis
SupabaseSupabase
Neo4jNeo4j
PineconePinecone
ChromaDBChromaDB
DockerDocker
KubernetesKubernetes
GitHub ActionsGitHub Actions
GitLab CIGitLab CI
TerraformTerraform
SnowflakeSnowflake
DatabricksDatabricks
dbtdbt
AirflowAirflow
Apache SparkApache Spark
PandasPandas
NumPyNumPy
JupyterJupyter
02 — EXPERIENCE

Where I've Built & Learned

Founder & Full-Stack / AI Engineer

EkkharaFounder · EdTech for Myanmar
May 2026 — Present

Founded Ekkhara, a self-funded software & AI studio building EdTech products for Myanmar — taking each from idea to live product and owning architecture, backend, AI, and front-end end-to-end.

Built SpeakProof, a TOEFL speaking & English-practice bot that runs entirely inside Telegram — letting Myanmar learners train for the TOEFL computer-based test from an app they already use daily, with no VPN needed despite the country's internet restrictions.

Engineer the full stack behind each product — Python / FastAPI services, LLM-driven feedback and scoring, and conversational UX — shipping AI tools that reach users where access and infrastructure are constrained.

Full-Stack AI Engineer

Siloett.AIAI Safety & Compliance · Station F · Paris
Jun 2025 — May 2026

Architected and built an end-to-end Generative AI platform from zero — LLM orchestration, RAG pipelines, and a production React/TypeScript frontend with Python (FastAPI) and Azure Functions serverless backend

Designed and implemented responsible-AI validation layers including content-safety filters, bias-detection checks, and compliance guardrails ensuring all AI-generated outputs meet regulatory and ethical standards

Built AI safety and IP compliance systems including audit logging, provenance tracking, and output attribution pipelines — ensuring traceable and accountable AI output lifecycles

Developed prompt-engineering and LLM evaluation frameworks using Azure OpenAI GPT-4o and LangChain, with systematic fine-tuning to optimise domain accuracy across compliance use cases

Data Science / Cloud Data Engineer

Floware SASStation F · Paris
Jul 2024 — Dec 2025

Designed a cloud-based batch data processing pipeline using Microsoft Azure Batch and Docker for urban mobility analytics

Automated Computer Vision and Bluetooth sensor workflows enabling scalable, real-time smart city insights

Delivered production analytics dashboards adopted by city government and transport stakeholders

Research & Back-End Engineer

DICE Lab — Telecom SudParisComputer Vision · Backend AI
Aug 2023 — Jul 2024

Engineered back-end services and data pipelines for a research lab delivering applied-AI capabilities to partner organisations — productionising computer-vision and NLP research behind Python APIs

Built a complete English–Myanmar machine translation system on a 10K gold-labelled WikiHow corpus — owning dataset construction, model training, evaluation, and deployment behind a public API

Developed novel research prototypes in computer vision and multilingual NLP, plus the back-end infrastructure to serve them — turning lab research into reproducible, deployable tools

Research & Back-End Engineer

AIT BrainLabApplied-AI Lab · Backend
Jan 2023 — Aug 2023

Built an NLP paraphrasing tool end-to-end at a research lab providing applied-AI services to companies — from model development to back-end API

Engineered the data layer — cleaning and structuring training datasets — and optimised training workflows, improving model accuracy by 15%

Owned back-end operations for the lab's AI tooling, strengthening data-management and R&D engineering practice

Software Engineer (Web)

FAO — UN Food & Agriculture OrgFull-Stack Web · UN Agency
Jan 2021 — Dec 2022

Built full-stack web applications for a United Nations agency (FAO) — internal data dashboards and public-facing portals delivering agricultural and food-security programme data to staff and field teams

Engineered field data-collection tools and the back-end APIs behind them, letting field officers capture, validate, and sync programme data into centralised reporting systems

Owned features end-to-end across the stack — React/Next.js + TypeScript front-ends with Python and PHP/CMS back-ends — from data model and REST API to deployed UI

03 — PROJECTS

Things I've Built

FEATURED
🗣️

SpeakProof

Telegram-native TOEFL speaking & English-practice bot for Myanmar learners — drill toward the TOEFL computer-based test inside an app people already use daily, with zero VPN needed despite nationwide internet restrictions. LLM-powered speaking feedback and scoring on a Python / FastAPI backend. An Ekkhara EdTech product.

PythonTelegram BotFastAPILLMTOEFL
🗣️
LIVE ON TELEGRAM
FEATURED

GridFlex

Real-time European grid lakehouse on AWS — probabilistic load forecasting (DeepAR / quantile loss) and stochastic optimisation for battery flexibility decisions. Medallion architecture on S3 + Iceberg, MSK Kafka streaming, dbt models, Airflow orchestration, MLflow tracking. Cloud Data Engineering portfolio piece.

AWSIcebergKafkadbtAirflowMLflow
OPEN-SOURCE ON GITHUB
FEATURED
📞

CDR Pipeline

Event-driven Call Detail Record ingestion, rating, and reconciliation pipeline simulating an MVNO billing back-end. Idempotent Kafka consumers, Spring Boot 3.5 microservices, MySQL for rated CDRs, MongoDB for raw events, Docker Compose for local infra. Built around real telecom domain models (3GPP TS 32.298).

Java 21Spring Boot 3.5KafkaMySQLMongoDBDocker
📞
OPEN-SOURCE ON GITHUB
FEATURED
🌱

CSRD Lake

End-to-end CSRD/ESRS sustainability data pipeline — Claude/Mistral GenAI extraction from sustainability reports with page-level audit lineage, Snowflake warehouse (validated) + DuckDB local dev, dbt transformations, Airflow orchestration. Built for EU corporate sustainability reporting compliance.

SnowflakeDuckDBdbtAirflowClaudeMistral
🌱
OPEN-SOURCE ON GITHUB
FEATURED
🩺

VitaLens

AI-powered blood test interpretation for personalised supplement guidance — competing at Haleon VivaTech 2026. OCR ingestion of French lab reports, validated biomarker classification, longitudinal tracking, and supplement recommendations grounded in clinical evidence. Real-data moat through partner lab integrations.

Next.jsFastAPIPostgreSQLOpenAIOCR
🩺
LIVE ON VERCEL
FEATURED
🔬

AgentProbe

From-scratch ReAct Agent Observatory — observe, debug, and benchmark LLM agents with a built-in 8-type failure taxonomy (hallucinated tools, malformed actions, context overflow, goal drift, …) and multi-provider eval harness (Groq, OpenAI, Anthropic, Google, Ollama). Composite scoring (answer + tools + efficiency + reliability), Clean Architecture FastAPI + PostgreSQL backend, Next.js 16 frontend with real-time SSE streaming. 50+ benchmark cases, 81 tests.

PythonFastAPINext.js 16PostgreSQLReActSSE
🔬
OPEN-SOURCE ON GITHUB
FEATURED
💉

VaxEvidence

Real-World Evidence platform for vaccine research — full-stack application for ingesting, analyzing, and visualizing vaccine safety and efficacy data from real-world sources at scale. Built at Siloett.AI for healthcare and life-sciences buyers.

Next.jsReactTypeScriptSupabase
💉
LIVE ON VERCEL
💳

Diameter CC

Diameter Credit-Control Server (Gy / RFC 4006 / App-Id 4) for real-time prepaid mobile billing. Idempotent partial debits, jdiameter 1.7.x stack, designed against TRANSATEL/MVNO production patterns.

Java 21Spring Boot 3.5Diameterjdiameter
📨

SMPP Gateway

Java SMPP v3.4 gateway bridging inbound SMS (submit_sm) to RabbitMQ for downstream processing. Spring Boot 3.5 + jsmpp 3.0, dockerised, designed for MVNO and telecom messaging platforms.

Java 21Spring Boot 3.5SMPPRabbitMQDocker
🚦

Mobility Pulse

Real-time urban mobility analytics on TimescaleDB + PostGIS + Uber H3. Spring Boot ingestion, Kafka stream processing, Server-Sent Events for live dashboards. Smart-city back-end blueprint.

Spring BootKafkaTimescaleDBPostGISH3
🏦

BCBS 239 Lakehouse

Reference implementation of the BCBS 239 risk-data-aggregation lakehouse pattern on Databricks + Delta Lake + Unity Catalog + dbt-databricks. MIT-licensed, synthetic data only — banking risk-data engineering portfolio piece.

DatabricksDelta LakeUnity Catalogdbt

SafeGen.dev

Serverless middleware enforcing responsible-AI compliance on LLM applications — PII detection, bias screening, hate-speech filtering, and RAG-powered policy enforcement on top of Azure OpenAI.

ReactAzure FunctionsFAISSGPT-4o

GreenLens

Cloud Carbon Intelligence platform estimating CO2e emissions from Azure infrastructure with AI-powered reduction recommendations. Clean Architecture, Azure AI Search semantic factor lookup, 88 automated tests. Built for EU CSRD Scope 3 compliance.

ASP.NET CoreAngularAzure AI SearchAzure OpenAI
04 — SKILLS

Tools of the Trade

AIPythonJavaTypeScriptPyTorchSpring BootNext.jsLangChainAzureAWSSnowflake
AI / ML
PyTorchPyTorch
TensorFlowTensorFlow
scikit-learnscikit-learn
Hugging FaceHugging Face
LangChainLangChain
OpenAIOpenAI
FAISSFAISS
LANGUAGE
PythonPython
JavaJava
TypeScriptTypeScript
JavaScriptJavaScript
SQLSQL
FRONTEND
ReactReact
Next.jsNext.js
Tailwind CSSTailwind CSS
BACKEND
Spring BootSpring Boot
FastAPIFastAPI
Node.jsNode.js
DjangoDjango
ASP.NET CoreASP.NET Core
KafkaKafka
RabbitMQRabbitMQ
CLOUD
AzureAzure
AWSAWS
GCPGCP
VercelVercel
DATABASE
PostgreSQLPostgreSQL
MongoDBMongoDB
RedisRedis
SupabaseSupabase
Neo4jNeo4j
PineconePinecone
ChromaDBChromaDB
DEVOPS
DockerDocker
KubernetesKubernetes
GitHub ActionsGitHub Actions
GitLab CIGitLab CI
TerraformTerraform
DATA
SnowflakeSnowflake
DatabricksDatabricks
dbtdbt
AirflowAirflow
Apache SparkApache Spark
PandasPandas
NumPyNumPy
JupyterJupyter
ALSO FLUENT IN

Iceberg · Delta Lake · Unity Catalog · TimescaleDB · PostGIS · H3 · jdiameter · jsmpp · LangGraph · MLflow · Weights & Biases · DVC · Ray · Celery · GraphQL · Prisma · SQLAlchemy · Playwright · Vitest · pytest · Testcontainers

05 — EDUCATION

Asia Europe.
The Journey Shapes the Engineer

🇲🇲
2016—2020
Yangon
Origins
🇹🇭
2022—2024
Bangkok
Asian Institute of Technology
🇫🇷
2023—2024
Paris
Telecom SudParis
🏙
2025—Now
Station F
Siloett.AI
🇫🇷
2023 — 2024

MSc Data Science & Network Intelligence

Telecom SudParis
Paris, France
GPA
15.15/20
QS Rank #46 Worldwide
🇹🇭
2022 — 2024

MSc Data Science & Artificial Intelligence

Asian Institute of Technology
Bangkok, Thailand
GPA
3.17/4.0
Dual Degree Program
🇲🇲
2016 — 2020

BA Social Science Studies

Myanmar Institute of Theology
Yangon, Myanmar
GPA
3.67/4.0
Graduated Top of Class
🇲🇲
2016 — 2020*

BA English Literature & Linguistics

University of Yangon
Yangon, Myanmar
GPA
4.09/5.0
Myanmar's Oldest University
*Interrupted due to COVID-19 & Military Coup
07 — CONTACT

Let's Build
Something.

Currently exploring Backend Engineer, Cloud Data Engineer, and AI Engineer roles across France, Germany, and the UK — plus founding team opportunities. If you're hiring or know of a fit, I'd love to hear from you.

BASED IN PARIS · OPEN TO REMOTE