Summary
I am a data professional with extensive experience in data analytics, data science, machine learning, artificial intelligence.
I am well versed in various data science essential programming languages such as Python, SQL, Pyspark and HiveQL.
I develop Machine Learning/Artificial Intelligence solution at scale to tackle various business challenges. (eg: spam/fraud detection, behavioural sequence learning, Risk LLM, video classification, image clustering, etc.)
Work Experience
Lead Machine Learning Engineer - Bytedance (Singapore)
Apr 2024 - Present
- Lead & develop all creator & content risk control effort for TikTok new business, LocalService.
- Tackle unique risk in Tiktok LocalService with SOTA algorithms and models. (LLM, GraphSage, etc)
- Build the foundation spam perception capabilities for LocalService. (Leiden community, video/text content clustering, etc)
- Plan and lead team to develop spam defence capabilities for LocalService. (Real time behaviour sequence, classification models, etc)
- Leveraged and fine tuned vLLM/LLM models for various use case (sentiment analysis, relevancy prediction, review rewritting, etc.)
- Collaborate with XFN to design policies and enforcement for fraudsters on Tiktok LocalService.
- Mentored and coached new junior/senior data scientist and MLE.
Senior Data Scientist - Bytedance (Singapore)
Nov 2021 - Apr 2023
- Build complex rules, algorithms and ML models to respond and mitigate business risk in TikTok Live ecosystem.
- Design risk control measurements and develop end to end automated pipeline for near realtime metric monitoring to enable quick response against spam/fraudster within TikTok Live ecosystem.
- Reduces TikTok Live ecosystem risk and fraud by >70% with a combination of multiple high precision ML models and risk control strategies.
- Develop and deployed near realtime behavioural sequence model that successfully mitigate >90% of scam cases on TikTok Live ecosystem.
- Collaborate with XFN such as Trust & Safety team to ensure business risk are not propogated outside of TikTok Live ecosystem.
- Mentored and coached new junior data scientist to ensure seamless onboarding process.
Data Science Strategist - Facebook (Singapore)
Nov 2019 - Jun 2021
- Lead partnership programs with mobile operator partners using data analytics and various tools that help them transform their networks and infrastructure.
- Lead APAC network planning go to market engagements, helping partners to deploy mobile cell towers or fiber efficiently.
- Build data pipelines that automates monthly analytics report for telco partners.
- Leverage ML prediction/classification models to support partner engagements.
Data Scientist - AIA (Malaysia)
Oct 2018 - Oct 2019
- Productionize multiple machine learning models. Fully automated model (re)training, validation and scoring.
- Leveraged text mining/NLP algorithms on call centre data to extract valueable insights.
- Automate address geocoding and developed innovative customer-to-agent allocation algorithm.
- Mentor and assist junior data scientist in various projects.
Network Data Analyst - Digi Telecommunications (Malaysia)
Sep 2016 - Sep 2018
- Develop smart network investment framework via collaboration with Marketing Data Science team. Build churn and NPS prediction model to maximize ROI of network investment and enables prioritization based on business needs.
- Implement new network optimization strategy and feature trials to ensure network performance is on par or better than competitors. Improves user download throughput by 20%.
- Collaborate with marketing Data Science team to develop innovate solutions such as traffic forecast, congesition prediction model that improves network investment decisions.
Regional RF Engineer - Maxis (Malaysia)
Apr 2014 - Sep 2016
- Utilized geospatial data analysis tools to pin point and reduce more than 25% of regional customer complain after new site, re-engineering and RF optimization efforts.
- Deploy mobile coverage vehicle (MCV) for special events such as Formula 1 and Moto GP and implemented necessary optimization mitigation strategy to achieve zero congestion throughout the event.
- Improved major highway coverage and quality to achieve zero drop call. Achieved regional lowest capacity hotspots after bi-sector upgrades, LTE integration and RF optimization.
Data Science Projects
TikTok - Order Reviews Relevancy and Sentiment Analysis
- Fine tuned foundation LLM model with order reviews to predict the sentiment and relevancy of reviews to merchant/POI.
TikTok - Video POI Relevancy Prediction
- Develop video content relevancy prediction model to ensure user posted content does not violate platform policy and relevant to POI location tagged. Leveraged vLLM and finetuned embedding models.
TikTok - User Behavioural Sequence Model
- Develop LSTM/Transformer deep learning model that predicts suspicious user on TikTok Live platform by learning on multiple series of user behaviour sequence embeddings. (eg: action sequence, time difference sequence, IP address sequence, etc.)
TikTok - Spam Classifaction Model
- Leverages user behaviour statistics on TikTok Live platform to develop XGB classification model that predict spam chats and enforce punishments on TikTok Live.
Marketing - Upsell Campaign Takeup Prediction Model
- Combining user demographics, purchase/payment/claim history, created a GDBT classification model that predicts campaign takeup propensity. Improve campaign team lead selection for upcoming campaign.
Telco - Customer Churn Prediction Model
- Combining user demographics, usage behaviour, user experience and mobile network performance, trained a GDBT classification model that predicts user churn propensity.
Call Center/Email - NLP : Topic Modeling, Sentiment Analysis
- Sentiment analysis, topic modeling/summarization based on call center case description & email content using BERT model.
- Automate keyword/address/coordinates extraction using NER from case & email content.
Skills
Data Science & Analytics
-
Python/R/ORE/SQL/Hive
-
Advanced Data Analytics
-
Machine Learning
-
Pyspark/MLib
-
Deep Learning (Tensorflow/Pytorch)
-
NLP/LLM/Transformers
-
Knowledge Graph
-
Kafka
-
Geospatial Analytics
-
Tableau/Qlik
-
RPA
-
Git/Version Control
-
Project Management
-
AWS/GCP/Oracle Cloud/Heroku
-
Recommender System
Education
-
Bachelor of Engineering in Electronics & Comm. SystemAustralian National University2011 - 2012
-
Diploma in Electrical & Electronics EngineeringInti College Subang Jaya2008 - 2010
Language
- English ( Fluent )
- Mandarin ( Native )
- Cantonese ( Fluent )
- Malay ( Professional Working )
Interests
- Data Analytics
- Data Science
- Automation
- Robotics
- Artificial Intelligence
- Open Source