~/portfolio/init

Zhixiang Feng

Machine Learning Researcher and Software Engineer focused on retrieval systems

I work on neural reranking, retrieval evaluation, and practical software that makes model behaviour easier to inspect and improve.

RAG

current focus

First

BSc honours

MSc ACSE

current degree

profile_probe.sh

$ whoami

Machine Learning Researcher and Software Engineer focused on retrieval systems

$ inspect --availability

Machine Learning Researcher at Revax, improving neural reranking for UK tax RAG. Open to new AI, ML, and software engineering positions.

$ load_focus --tabs ai software

AI path

python eval_reranker.py --slice hard_tax_queries

Software path

npm test && ship --target browser cloud

$ status

ready for review, refactor, deployment

role router

Browse the portfolio by signal

python eval_reranker.py --slice hard_tax_queries

My current work is neural reranking for UK tax RAG. My research background covers adversarial attacks on ranking systems, BERT based question answering, and synthetic clinical label generation.

RAGneural rerankinginformation retrievalBERTsynthetic dataevaluation

npm test && ship --target browser cloud

I have built Chrome extensions, Django systems, Angular and Ionic mobile flows, and SaaS tooling. I care about clear permissions, local data safety, CI, and interfaces that stay usable under real users.

RustJavaScriptDjangoChrome extensionsCIsystem design

selected work

Projects that explain how I think

Selected work from retrieval research, applied ML, browser tooling, and full stack projects. The focus is on what was built, what was evaluated, and where the engineering judgement mattered.

filter controlled by role tabs

2026

UK Tax RAG Neural Reranker

AI Software

Current ML research work at Revax, focused on improving the neural reranker behind a UK tax retrieval augmented generation workflow.

PythonRAGneural rerankingretrieval evaluationtax domain QA
  • Improving retrieved evidence quality before answer generation
  • Analysing hard tax queries and reranker failure cases
  • Turning evaluation findings into practical reranker changes

2025

Synthetic Epilepsy Labels Generation

AI

AI for healthcare research on generating synthetic multi-label epilepsy datasets from a small set of clinical letters.

Pythonsynthetic dataclinical NLPmulti-label classificationevaluation
  • Investigating synthetic labels for scarce real-world medical text
  • Designing checks for label diversity, accuracy, and reliability
  • Targeting more than 15,000 synthetic cases from fewer than 100 real samples

2024 to 2025

Adversarial Attacks on Ranking Systems

AI

Research on how adversarial attacks affect neural and non-neural retrieval pipelines in search systems.

Pythonneural rankinginformation retrievaladversarial attacksranking evaluation
  • Implemented recent adversarial attack algorithms targeting ranking components
  • Simulated search pipelines that combine neural and non-neural retrieval
  • Analysed how attacks changed document order, relevance scores, and search behaviour

2023 to 2024

Hawk Page Indexer

Software AI

Chrome extension built in a University of Glasgow team project with Amazon.com, Inc. as client, combining a page indexer, markdown notebook, and to-do list.

Project page: Hawk - Page Indexer on the Chrome Web Store

JavaScriptNode.jsChrome extensionBM25FMiniSearchGitLab CI
  • Implemented local page indexing and search over selected web content
  • Used BM25F with MiniSearch fallback for ranked document retrieval
  • Supported JSON import and export for indexed page backups
  • Published as Hawk - Page Indexer on the Chrome Web Store
  • Presented a final product recognised by software engineers at Amazon Edinburgh

2024

AI Bot in a Tank Game

AI Software

First place hackathon entry for MS Tanks, using a TCP API, an epsilon-greedy strategy, and a subsumption architecture.

Pythonreinforcement learningQ-tableTCP APIagent behaviour
  • Built an autonomous tank agent for the Do You Have the GUTS Hackathon 2024
  • Balanced exploration and exploitation with an epsilon-greedy policy
  • Prioritised survival and resource search through subsumption based behaviours

2024

NLP Question Answering System

AI

Question answering project using WikiQA, set similarity methods, term frequency vectors, and BERT based context vectors.

PythonBERTWikiQAcosine similarityclassification metrics
  • Compared overlap coefficient, Sorensen-Dice, and Jaccard similarity
  • Evaluated BERT based answer selection with accuracy, precision, recall, and F1
  • Analysed model stability across different question and answer scenarios

2023

AuTogether Event Registration App

Software

Mobile event registration app for a non-profit organisation supporting children with autism and their families.

AngularIonicDjangoPythonJavaScript
  • Built event sign-up flows for parents and children
  • Added organiser tools for venue capacity and booking management
  • Connected mobile UI work with backend functionality

2023

Best Recipes Web App

Software

Recipe sharing web app where users could publish recipe blogs, rate posts, save favourites, and explore trends.

PythonSQLiteJavaScriptHTMLCSS
  • Worked on requirements analysis, UI design, technology selection, testing, and deployment
  • Built posting, editing, rating, and favourites functionality
  • Used the project to practise full stack delivery from schema to interface

capability map

Tools are useful, but judgement is the real skill

Machine learning and IR

PyTorchscikit-learnHugging FaceRecommender SystemsRAGNeural Reranking

Programming

PythonRustCC++JavaScriptTypeScriptSQLJavaHaskellErlangGo

Backend and Apps

DjangoNode.jsTauriAxumChrome ExtensionsAngularIonicReact Native

Engineering Practice

GitCI WorkflowsCI/CDDevOpsTestingCode ReviewSystem DiagramsParallel ProgrammingOpenMPMPICoaching
Evaluate retrieval failures before tuning prompts
Keep model changes tied to examples and metrics
Use local, data safe defaults when browser tooling touches user content
Write software another engineer can debug under deadline

timeline

Experience shaped around proof

Machine Learning Researcher

Revax

05/2026 to present

  • Working on UK tax RAG retrieval quality, with the current focus on improving the neural reranker
  • Analysing hard queries and reranker failures so model changes are tied to search behaviour
  • Connecting evaluation results to practical changes in tax-focused question answering
RAGneural rerankingevaluationPython

MSc Applied Computational Science and Engineering

Imperial College London

09/2025 to 09/2026 expected

  • Studying computational methods for applied science and engineering problems
  • Building on a computing science background with stronger modelling and numerical foundations
  • Using the programme to sharpen ML, simulation, and software engineering judgement
MSccomputational scienceengineering

BSc Computing Science

University of Glasgow

09/2021 to 07/2025

  • Graduated with First Class Honours and a specialism in Data Management
  • Received the Undergraduate Excellence Scholarship from 2021 to 2025
  • Built a foundation across algorithms, data, software engineering, and applied AI projects
First Class Honoursdata managementscholarship

Assistant Demonstrator

School of Computing Science, University of Glasgow

09/2024 to 06/2025

  • Delivered tutorials and hands-on labs for Computer Programming, Further Programming, and Algorithmic Foundations
  • Helped students work through core programming concepts and practical exercises
  • Marked assessed exercises and gave feedback that students could act on
coachingprogrammingalgorithms

Full Stack Developer Intern

MESYI Technology Ltd.

06/2023 to 08/2023

  • Worked on requirements analysis, technology selection, database design, and system architecture
  • Developed SaaS staff and business management features with Python, Django, and JavaScript
  • Produced ER diagrams, wireframes, and architecture diagrams before supporting testing, deployment, and database connection
DjangoJavaScriptSaaSdatabase design

secure channel

Send a focused message

Tell me what you are building, what is broken, or what role you are hiring for. A specific message beats a polished one.