Victor Johnson
Senior AI / LLM Engineer | Production RAG & LLM Systems | Data Science | Azure, Python
Senior AI / LLM Engineer with 7+ years of experience building production LLM and GenAI systems, including RAG, intelligent search, and LLM-powered applications in Azure.
I own LLM systems end-to-end: ingestion, embeddings, retrieval, prompt orchestration, evaluation, deployment, and monitoring, with a focus on reliability, performance, and cost in real products.
Core expertise: LLMs, RAG, vector search, LLM evaluation, Python, Azure OpenAI, Spark, MLOps etc.
England, United Kingdom

Tech Stack
LLMs & Generative AI
- Large Language Models (LLMs)
- Generative AI
- Prompt Engineering
- Retrieval-Augmented Generation (RAG)
- Semantic Search
- Embeddings & Vector Search
- Vector Databases
- LLM Evaluation & Monitoring
- LLMOps
- Chatbot Development
- LlamaIndex
- Azure OpenAI
- Azure AI Foundry
Machine Learning & NLP
- Machine Learning
- Applied Machine Learning
- Natural Language Processing (NLP)
- Scikit-learn
- Artificial Intelligence (AI)
- Predictive Maintenance
Data Science & Analytics
- Data Science
- Python
- SQL
- Transact-SQL (T-SQL)
- Statistical Modeling
- Data Visualization
- Financial Modeling
Data Engineering & Big Data
- Apache Spark
- Azure Databricks
- Apache Kafka
- Kafka Streams
- Apache Airflow
- Azure Data Factory
- Elasticsearch
- Azure SQL
Knowledge Graphs & Search
- Knowledge Graphs
- Knowledge Graph Data Engineering
- Semantic Search & Information Retrieval
MLOps & Production AI
- MLOps
- MLflow
- Model Monitoring
- Model Drift & Bias Detection
- Docker
Cloud & Platforms
- Microsoft Azure
- Azure Machine Learning
- Oracle Cloud Infrastructure (AI Foundations)
Software & APIs
- FastAPI
- REST APIs
- Software Development
- Git
- GitHub
Experience
Data Scientist (AI/LLM Focus)
Feb 2021 — PresentCompany - SYMEUS LTD
England, United Kingdom · Hybrid | Industry - Finance
Operating as a senior individual contributor, leading the design, deployment, and operation of production LLM and RAG systems end-to-end. AI / LLM Systems (Primary): Led development of production-grade LLM and RAG solutions, owning ingestion, embedding pipelines, retrieval, prompt orchestration, evaluation, deployment, and monitoring Built LLM-powered intelligent search, chatbots, and ranking workflows used in internal products Designed and implemented LLM evaluation frameworks (precision/recall, F1, BLEU, ROUGE, relevance & faithfulness metrics) to measure quality and reduce hallucinations Implemented prompt strategies, grounding logic, guardrails, and output validation to improve reliability and safety in live systems Fine-tuned transformer and LLM models on domain-specific data to improve retrieval accuracy and contextual relevance Built scalable data preparation pipelines (cleaning, tokenization, chunking, embeddings) for training and fine-tuning Ran A/B tests and benchmarks across models, embeddings, retrievers, and prompt variants to optimize quality, latency, and cost Machine Learning & Applied Analytics (Secondary): Developed forecasting models for revenue, traffic, and user activity using Python and ML frameworks Built statistical models to support scenario planning and financial analysis Created automated insights pipelines using GA4, GSC, and internal data sources Data Engineering & Platform: Built scalable data pipelines using Apache Spark and Azure Synapse to support near real-time analytics and ML workloads Optimized complex SQL / T-SQL queries (CTEs, window functions, indexing) to improve performance Designed KPI dashboards in Power BI and Streamlit using advanced DAX and Python Improved data quality through governance, validation, and automated checks, reducing reporting turnaround time by ~40%
Data Scientist (ML Focus)
Dec 2018 — Dec 2020Company - Alstom
Bengaluru, India · On-site | Industry - Railways
Applied Machine Learning & Predictive Systems: Developed production machine learning models for predictive maintenance, including Remaining Useful Life (RUL) estimation for critical train components Built time-series and survival analysis models to predict failures, degradation, and maintenance needs across multiple subsystems Analyzed high-volume telemetry data to identify failure patterns, sensor drift, and anomalous behavior in operational environments Designed component-level health indicators and engineered features that improved prediction accuracy and model stability Data Engineering & Model Integration Implemented robust data validation, preprocessing, and feature pipelines to ensure reliability of sensor and operational data Integrated predictive models into operational reporting and decision-support systems, enabling faster and more informed maintenance decisions Supported condition-based maintenance strategies that improved fleet availability and reduced unplanned downtime Analytics Platforms & Visualization Built dashboards in Shiny, Qlik Sense, and Tableau to visualize asset health, predictions, and maintenance KPIs Contributed to the setup of the operations center by delivering KPI-driven visualizations and automated model outputs Applied IEC 62541 standards to improve data acquisition consistency and interoperability across systems
Junior Data Scientist
Dec 2017 — Dec 2018Company - Alstom
Bengaluru, India · On-site | Industry - Railways
Built an early semantic search engine using Elasticsearch and Python (Flask), supporting information retrieval use cases Developed automated ETL pipelines using Apache Airflow Contributed to Spark and Kafka streaming workflows for real-time telemetry data processing Built RPA workflows to automate SAP-based maintenance data handling
Data Science Intern
Sep 2017 — Nov 2017Company - Pi Revolutions
Bengaluru, India · On-site | Industry - Retail Tech
Supported data analysis and automation workflows for NFC-enabled billing kiosks Performed exploratory analysis to support process improvements
Intern
Aug 2016 — Sep 2016Company - Alstom
Bengaluru, India · On-site | Industry - Railways
Automated routine business processes using VBA in Excel, ensuring compliance with internal data standards and improving efficiency, which saved more than 10 hours of reporting work each week
See what my peers and managers say about my work
View verified LinkedIn recommendations↗
Education
Master of Science in Data Science
2021 — 2022 | Grade: DistinctionUniversity of East Anglia
Academic Projects:
- –Depression Detection Using Machine Learning(2021)
Bachelor of Technology in Computer Science
2013 — 2017 | Grade: First ClassUniversity of Calicut
Academic Projects:
- –Weather Forecasting Using Data Mining(2017)
- –Traffic Sign Board Detection and Alerting using Computer Vision(2016)
Certifications
Data Versioning, Lineage, and Quality Monitoring for AI
Check CredentialIntroduction to MLSecOps
Check CredentialSkills: Machine Learning · MLOps · Artificial Intelligence (AI)
Knowledge Graph Data Engineering for Generative AI Use Cases
Check CredentialSkills: Generative AI · Knowledge Graphs · Retrieval-Augmented Generation (RAG) · Artificial Intelligence (AI)
MLOps Essentials: Monitoring Model Drift and Bias
Check CredentialSkills: MLOps · Artificial Intelligence (AI)
MLOps and Data Pipeline Orchestration for AI Systems
Check CredentialSemantic Search and Information Retrieval using GenAI
Check CredentialSkills: Generative AI · Semantic Search · Artificial Intelligence (AI)
Working with Data: Engineering, Integration, and MLOps for AI
Check CredentialSkills: Large Language Model Operations (LLMOps) · Vector Databases · MLOps · Artificial Intelligence (AI)
Alteryx Auto Insights Micro-Credential
Check CredentialCredential ID: 30b212a6-8f6b-4a8d-b44a-d19004b4ab08
Alteryx Designer Core Certified
Check CredentialCredential ID: fa2d1b38-ef83-402b-ab21-e21a91d3e4a3
Alteryx Machine Learning Fundamentals Micro-Credential
Check CredentialCredential ID: af37e6ea-67cb-4b9e-9b8d-ff94afb292a3
Academy Accreditation - Databricks Fundamentals
Check CredentialSkills: Azure Databricks
Academy Accreditation - Generative AI Fundamentals
Check CredentialSkills: Azure Databricks
Oracle Cloud Infrastructure 2025 Certified AI Foundations Associate
Check CredentialCredential ID: 1Z0-1122-25
Skills: Machine Learning · Artificial Intelligence (AI)
AI-Driven Market Analysis: Predict & Profit with ML Models
Check CredentialSkills: Machine Learning
Expert Certificate: Marketing Data Analysis & Data Analytics
Check CredentialSkills: Machine Learning