Victor Johnson
Hey there, I am

Victor Johnson

Senior AI / LLM Engineer | Production RAG & LLM Systems | Data Science | Azure, Python

Senior AI / LLM Engineer with 7+ years of experience building production LLM and GenAI systems, including RAG, intelligent search, and LLM-powered applications in Azure. I own LLM systems end-to-end: ingestion, embeddings, retrieval, prompt orchestration, evaluation, deployment, and monitoring, with a focus on reliability, performance, and cost in real products. Core expertise: LLMs, RAG, vector search, LLM evaluation, Python, Azure OpenAI, Spark, MLOps etc.

England, United Kingdom

Victor Johnson

Tech Stack

LLMs & Generative AI

  • Large Language Models (LLMs)
  • Generative AI
  • Prompt Engineering
  • Retrieval-Augmented Generation (RAG)
  • Semantic Search
  • Embeddings & Vector Search
  • Vector Databases
  • LLM Evaluation & Monitoring
  • LLMOps
  • Chatbot Development
  • LlamaIndex
  • Azure OpenAI
  • Azure AI Foundry

Machine Learning & NLP

  • Machine Learning
  • Applied Machine Learning
  • Natural Language Processing (NLP)
  • Scikit-learn
  • Artificial Intelligence (AI)
  • Predictive Maintenance

Data Science & Analytics

  • Data Science
  • Python
  • SQL
  • Transact-SQL (T-SQL)
  • Statistical Modeling
  • Data Visualization
  • Financial Modeling

Data Engineering & Big Data

  • Apache Spark
  • Azure Databricks
  • Apache Kafka
  • Kafka Streams
  • Apache Airflow
  • Azure Data Factory
  • Elasticsearch
  • Azure SQL

Knowledge Graphs & Search

  • Knowledge Graphs
  • Knowledge Graph Data Engineering
  • Semantic Search & Information Retrieval

MLOps & Production AI

  • MLOps
  • MLflow
  • Model Monitoring
  • Model Drift & Bias Detection
  • Docker

Cloud & Platforms

  • Microsoft Azure
  • Azure Machine Learning
  • Oracle Cloud Infrastructure (AI Foundations)

Software & APIs

  • FastAPI
  • REST APIs
  • Software Development
  • Git
  • GitHub

Experience

Data Scientist (AI/LLM Focus)

Feb 2021Present

Company - SYMEUS LTD

England, United Kingdom · Hybrid | Industry - Finance

Operating as a senior individual contributor, leading the design, deployment, and operation of production LLM and RAG systems end-to-end. AI / LLM Systems (Primary): Led development of production-grade LLM and RAG solutions, owning ingestion, embedding pipelines, retrieval, prompt orchestration, evaluation, deployment, and monitoring Built LLM-powered intelligent search, chatbots, and ranking workflows used in internal products Designed and implemented LLM evaluation frameworks (precision/recall, F1, BLEU, ROUGE, relevance & faithfulness metrics) to measure quality and reduce hallucinations Implemented prompt strategies, grounding logic, guardrails, and output validation to improve reliability and safety in live systems Fine-tuned transformer and LLM models on domain-specific data to improve retrieval accuracy and contextual relevance Built scalable data preparation pipelines (cleaning, tokenization, chunking, embeddings) for training and fine-tuning Ran A/B tests and benchmarks across models, embeddings, retrievers, and prompt variants to optimize quality, latency, and cost Machine Learning & Applied Analytics (Secondary): Developed forecasting models for revenue, traffic, and user activity using Python and ML frameworks Built statistical models to support scenario planning and financial analysis Created automated insights pipelines using GA4, GSC, and internal data sources Data Engineering & Platform: Built scalable data pipelines using Apache Spark and Azure Synapse to support near real-time analytics and ML workloads Optimized complex SQL / T-SQL queries (CTEs, window functions, indexing) to improve performance Designed KPI dashboards in Power BI and Streamlit using advanced DAX and Python Improved data quality through governance, validation, and automated checks, reducing reporting turnaround time by ~40%

Data Scientist (ML Focus)

Dec 2018Dec 2020

Company - Alstom

Bengaluru, India · On-site | Industry - Railways

Applied Machine Learning & Predictive Systems: Developed production machine learning models for predictive maintenance, including Remaining Useful Life (RUL) estimation for critical train components Built time-series and survival analysis models to predict failures, degradation, and maintenance needs across multiple subsystems Analyzed high-volume telemetry data to identify failure patterns, sensor drift, and anomalous behavior in operational environments Designed component-level health indicators and engineered features that improved prediction accuracy and model stability Data Engineering & Model Integration Implemented robust data validation, preprocessing, and feature pipelines to ensure reliability of sensor and operational data Integrated predictive models into operational reporting and decision-support systems, enabling faster and more informed maintenance decisions Supported condition-based maintenance strategies that improved fleet availability and reduced unplanned downtime Analytics Platforms & Visualization Built dashboards in Shiny, Qlik Sense, and Tableau to visualize asset health, predictions, and maintenance KPIs Contributed to the setup of the operations center by delivering KPI-driven visualizations and automated model outputs Applied IEC 62541 standards to improve data acquisition consistency and interoperability across systems

Junior Data Scientist

Dec 2017Dec 2018

Company - Alstom

Bengaluru, India · On-site | Industry - Railways

Built an early semantic search engine using Elasticsearch and Python (Flask), supporting information retrieval use cases Developed automated ETL pipelines using Apache Airflow Contributed to Spark and Kafka streaming workflows for real-time telemetry data processing Built RPA workflows to automate SAP-based maintenance data handling

Data Science Intern

Sep 2017Nov 2017

Company - Pi Revolutions

Bengaluru, India · On-site | Industry - Retail Tech

Supported data analysis and automation workflows for NFC-enabled billing kiosks Performed exploratory analysis to support process improvements

Intern

Aug 2016Sep 2016

Company - Alstom

Bengaluru, India · On-site | Industry - Railways

Automated routine business processes using VBA in Excel, ensuring compliance with internal data standards and improving efficiency, which saved more than 10 hours of reporting work each week

See what my peers and managers say about my work

View verified LinkedIn recommendations

LinkedIn

Education

Master of Science in Data Science

20212022 | Grade: Distinction

University of East Anglia

Academic Projects:

  • Depression Detection Using Machine Learning(2021)

Bachelor of Technology in Computer Science

20132017 | Grade: First Class

University of Calicut

Academic Projects:

  • Weather Forecasting Using Data Mining(2017)
  • Traffic Sign Board Detection and Alerting using Computer Vision(2016)

Certifications

Data Versioning, Lineage, and Quality Monitoring for AI

Check Credential
LinkedInIssued Dec 2025

Introduction to MLSecOps

Check Credential
LinkedInIssued Dec 2025

Skills: Machine Learning · MLOps · Artificial Intelligence (AI)

Knowledge Graph Data Engineering for Generative AI Use Cases

Check Credential
LinkedInIssued Dec 2025

Skills: Generative AI · Knowledge Graphs · Retrieval-Augmented Generation (RAG) · Artificial Intelligence (AI)

MLOps Essentials: Monitoring Model Drift and Bias

Check Credential
LinkedInIssued Dec 2025

Skills: MLOps · Artificial Intelligence (AI)

MLOps and Data Pipeline Orchestration for AI Systems

Check Credential
LinkedInIssued Dec 2025

Semantic Search and Information Retrieval using GenAI

Check Credential
LinkedInIssued Dec 2025

Skills: Generative AI · Semantic Search · Artificial Intelligence (AI)

Working with Data: Engineering, Integration, and MLOps for AI

Check Credential
LinkedInIssued Dec 2025

Skills: Large Language Model Operations (LLMOps) · Vector Databases · MLOps · Artificial Intelligence (AI)

AI Agents Fundamentals

Check Credential
Hugging FaceIssued Oct 2025

Credential ID: victor-johnson

Alteryx Auto Insights Micro-Credential

Check Credential
AlteryxIssued Oct 2025 · Expires Oct 2027

Credential ID: 30b212a6-8f6b-4a8d-b44a-d19004b4ab08

Alteryx Designer Core Certified

Check Credential
AlteryxIssued Oct 2025 · Expires Oct 2027

Credential ID: fa2d1b38-ef83-402b-ab21-e21a91d3e4a3

Alteryx Machine Learning Fundamentals Micro-Credential

Check Credential
AlteryxIssued Oct 2025 · Expires Oct 2027

Credential ID: af37e6ea-67cb-4b9e-9b8d-ff94afb292a3

n8n Course Level 1

Check Credential
n8nIssued Oct 2025

Credential ID: 4916639a1c6dddf0372d1f9fcf29623c

n8n Course Level 2

Check Credential
n8nIssued Oct 2025

Credential ID: 4916639a1c6dddf0372d1f9fcf29623c

Academy Accreditation - Databricks Fundamentals

Check Credential
DatabricksIssued Sep 2025

Skills: Azure Databricks

Academy Accreditation - Generative AI Fundamentals

Check Credential
DatabricksIssued Sep 2025

Skills: Azure Databricks

Oracle Cloud Infrastructure 2025 Certified AI Foundations Associate

Check Credential
OracleIssued Sep 2025 · Expires Sep 2027

Credential ID: 1Z0-1122-25

Skills: Machine Learning · Artificial Intelligence (AI)

AI-Driven Market Analysis: Predict & Profit with ML Models

Check Credential
UdemyIssued Oct 2025

Skills: Machine Learning

Expert Certificate: Marketing Data Analysis & Data Analytics

Check Credential
UdemyIssued Oct 2025

Skills: Machine Learning

NLP in Python: Probability Models, Statistics, Text Analysis

Check Credential
UdemyIssued Oct 2025

NotebookLM Mastery: Organize, Analyze, and Optimize with AI

Check Credential
UdemyIssued Oct 2025