Professional Summary

Results-driven Machine Learning Engineer and Data Scientist with extensive experience in AI, ML, and data analytics.

Skilled in building scalable ML models, developing LLM-based solutions, and deploying AI-driven risk mitigation strategies.

Proven ability to optimize fraud detection, behavioral analytics, and real-time AI solutions for large-scale platforms.

Adept at Python, PyTorch, TensorFlow, SQL, AWS, and big data processing.

Work Experience

Lead Machine Learning Engineer - Bytedance (Singapore)

Apr 2024 - Present

  • Lead AI-driven content risk control for TikTok LocalService, implementing cutting-edge LLM and GraphSage models.
  • Developed spam perception algorithms (Leiden clustering, content classification) to detect fraudulent content.
  • Built real-time behavioral sequence models, reducing fraud/spam activities by 90%.
  • LoRA fine-tuned multiple LLM models (Qwen2.5/Gemma3/Llama3.2) for content violation prediction. Reduced violative POI reviews by 60% at launch.
  • Long term technical strategy planning & execution for Tiktok LocalService overall risk control.
  • Collaborated with cross-functional teams (XFN) to design anti-fraud policies and risk mitigation strategies.
  • Mentored junior/senior data scientists, fostering AI innovation and knowledge sharing.

Senior Data Scientist - Bytedance (Singapore)

Nov 2021 - Apr 2023

  • Developed and deployed high-precision fraud detection ML models, reducing risk in the TikTok Live ecosystem by 70%+.
  • Designed real-time fraud detection pipelines to monitor and flag suspicious activity with >90% accuracy.
  • Automated behavioral sequence modeling using deep learning (LSTM, Transformers) to detect scam patterns.
  • Partnered with Trust & Safety teams to prevent fraudulent activities across TikTok’s global ecosystem.
  • Trained and mentored new hires in advanced ML techniques and AI deployment.

Data Science Strategist - Facebook (Singapore)

Nov 2019 - Jun 2021

  • Led APAC network planning initiatives, optimizing mobile tower placement and/or fiber routing using state of the art AI/ML solution.
  • Built automated data pipelines to generate network performance reports for global telecom partners.
  • Developed ML-based predictive models for infrastructure expansion and network optimization.

Data Scientist - AIA (Malaysia)

Oct 2018 - Oct 2019

  • Productionized fully automated ML models for repurchase propensity model and lead recommendation model.
  • Applied NLP and text mining to analyze call center data, extracting key insights for business decisions.
  • Automated geocoding and location-based analytics to improve agent allocation efficiency.
  • Mentor and assist junior data scientist in various innovation projects.

Network Data Analyst - Digi Telecommunications (Malaysia)

Sep 2016 - Sep 2018

  • Built churn and NPS prediction models to maximize ROI on network investments.
  • Developed network congestion forecasting models, improving customer experience by 20%.

Regional RF Engineer - Maxis (Malaysia)

Apr 2014 - Sep 2016

  • Used geospatial analytics to reduce customer complaints by 25% through network optimization.
  • Deployed mobile coverage solutions for major events like Formula 1 and Moto GP.

Key Machine Learning Projects

TikTok - Video POI Relevancy Prediction

  • Developed video content relevancy model to detect violatative POI videos.

TikTok - POI Reviews Violation Prediction

  • LoRA fine-tuned LLMs (Qwen2.5/Gemma3) for content policy violation prediction. Leverages SOTA LLMs for real time inference and defence.

TikTok - Order Reviews Relevancy and Sentiment Analysis

  • LoRA fine-tuned LLMs (Gemma2) for sentiment classification and content relevance scoring to improve review analysis.

TikTok - User Behavioural Sequence Model

  • Developed LSTM/Transformer models to detect suspicious activity in TikTok Live, achieving 90%+ fraud prevention accuracy.

TikTok - Spam Classifaction Model

  • Built XGBoost models to classify spam chats, enforcing real-time moderation.

Marketing - Upsell Campaign Takeup Prediction Model

  • Created gradient-boosted models (GBDT) for campaign targeting, optimizing user engagement.

Telco - Customer Churn Prediction Model

  • Developed churn propensity models leveraging user behavior and network data.

Call Center/Email - NLP : Topic Modeling, Sentiment Analysis

  • Applied BERT models for sentiment analysis, topic modeling, and automated summarization of support cases.

Skills

Data Science & Analytics

  • Python/R/ORE/SQL/Hive
  • Advanced Data Analytics
  • Machine Learning
  • Pyspark/MLib
  • Deep Learning (Tensorflow/Pytorch)
  • NLP/LLM/Transformers
  • Knowledge Graph
  • Kafka
  • vLLM/Sglang/Unsloth
  • Tableau/Qlik
  • RPA
  • Git/Version Control
  • Project Management
  • AWS/GCP/Oracle Cloud/Heroku
  • Recommender System

Education

  • Bachelor of Engineering in Electronics & Comm. System
    Australian National University
    2011 - 2012

Language

  • English ( Fluent )
  • Mandarin ( Native )

Interests

  • Data Analytics
  • Data Science
  • Automation
  • Robotics
  • Artificial Intelligence
  • Open Source Development