Professional Summary
Results-driven Machine Learning Engineer and Data Scientist with extensive experience in AI, ML, and data analytics.
Skilled in building scalable ML models, developing LLM-based solutions, and deploying AI-driven risk mitigation strategies.
Proven ability to optimize fraud detection, behavioral analytics, and real-time AI solutions for large-scale platforms.
Adept at Python, PyTorch, TensorFlow, SQL, AWS, and big data processing.
Work Experience
Lead Machine Learning Engineer - Bytedance (Singapore)
Apr 2024 - Present
- Lead AI-driven content risk control for TikTok LocalService, implementing cutting-edge LLM and GraphSage models.
- Developed spam perception algorithms (Leiden clustering, content classification) to detect fraudulent content.
- Built real-time behavioral sequence models, reducing fraud/spam activities by 90%.
- LoRA fine-tuned multiple LLM models (Qwen2.5/Gemma3/Llama3.2) for content violation prediction. Reduced violative POI reviews by 60% at launch.
- Long term technical strategy planning & execution for Tiktok LocalService overall risk control.
- Collaborated with cross-functional teams (XFN) to design anti-fraud policies and risk mitigation strategies.
- Mentored junior/senior data scientists, fostering AI innovation and knowledge sharing.
Senior Data Scientist - Bytedance (Singapore)
Nov 2021 - Apr 2023
- Developed and deployed high-precision fraud detection ML models, reducing risk in the TikTok Live ecosystem by 70%+.
- Designed real-time fraud detection pipelines to monitor and flag suspicious activity with >90% accuracy.
- Automated behavioral sequence modeling using deep learning (LSTM, Transformers) to detect scam patterns.
- Partnered with Trust & Safety teams to prevent fraudulent activities across TikTok’s global ecosystem.
- Trained and mentored new hires in advanced ML techniques and AI deployment.
Data Science Strategist - Facebook (Singapore)
Nov 2019 - Jun 2021
- Led APAC network planning initiatives, optimizing mobile tower placement and/or fiber routing using state of the art AI/ML solution.
- Built automated data pipelines to generate network performance reports for global telecom partners.
- Developed ML-based predictive models for infrastructure expansion and network optimization.
Data Scientist - AIA (Malaysia)
Oct 2018 - Oct 2019
- Productionized fully automated ML models for repurchase propensity model and lead recommendation model.
- Applied NLP and text mining to analyze call center data, extracting key insights for business decisions.
- Automated geocoding and location-based analytics to improve agent allocation efficiency.
- Mentor and assist junior data scientist in various innovation projects.
Network Data Analyst - Digi Telecommunications (Malaysia)
Sep 2016 - Sep 2018
- Built churn and NPS prediction models to maximize ROI on network investments.
- Developed network congestion forecasting models, improving customer experience by 20%.
Regional RF Engineer - Maxis (Malaysia)
Apr 2014 - Sep 2016
- Used geospatial analytics to reduce customer complaints by 25% through network optimization.
- Deployed mobile coverage solutions for major events like Formula 1 and Moto GP.
Key Machine Learning Projects
TikTok - Video POI Relevancy Prediction
- Developed video content relevancy model to detect violatative POI videos.
TikTok - POI Reviews Violation Prediction
- LoRA fine-tuned LLMs (Qwen2.5/Gemma3) for content policy violation prediction. Leverages SOTA LLMs for real time inference and defence.
TikTok - Order Reviews Relevancy and Sentiment Analysis
- LoRA fine-tuned LLMs (Gemma2) for sentiment classification and content relevance scoring to improve review analysis.
TikTok - User Behavioural Sequence Model
- Developed LSTM/Transformer models to detect suspicious activity in TikTok Live, achieving 90%+ fraud prevention accuracy.
TikTok - Spam Classifaction Model
- Built XGBoost models to classify spam chats, enforcing real-time moderation.
Marketing - Upsell Campaign Takeup Prediction Model
- Created gradient-boosted models (GBDT) for campaign targeting, optimizing user engagement.
Telco - Customer Churn Prediction Model
- Developed churn propensity models leveraging user behavior and network data.
Call Center/Email - NLP : Topic Modeling, Sentiment Analysis
- Applied BERT models for sentiment analysis, topic modeling, and automated summarization of support cases.
Skills
Data Science & Analytics
-
Python/R/ORE/SQL/Hive
-
Advanced Data Analytics
-
Machine Learning
-
Pyspark/MLib
-
Deep Learning (Tensorflow/Pytorch)
-
NLP/LLM/Transformers
-
Knowledge Graph
-
Kafka
-
vLLM/Sglang/Unsloth
-
Tableau/Qlik
-
RPA
-
Git/Version Control
-
Project Management
-
AWS/GCP/Oracle Cloud/Heroku
-
Recommender System
Education
-
Bachelor of Engineering in Electronics & Comm. SystemAustralian National University2011 - 2012
Language
- English ( Fluent )
- Mandarin ( Native )
Interests
- Data Analytics
- Data Science
- Automation
- Robotics
- Artificial Intelligence
- Open Source Development
