About — Fake Job Detector India

🎯 Why we built this

Every single day, millions of Indians receive fake job offers on WhatsApp, email, Naukri, and LinkedIn. Scammers impersonate well-known companies like Infosys, TCS, Wipro, and even government ministries. They promise high-paying roles and then ask victims to pay a "registration fee," "security deposit," or "training cost" — often ranging from ₹500 to ₹50,000.

These scams target India's most vulnerable job seekers — freshers, people from small towns, those recently laid off, and anyone desperate for income. According to government cybercrime data, job fraud is one of the fastest-growing categories of online scams in India, with thousands of complaints filed every month.

Fake Job Detector India was built to give every job seeker — regardless of technical knowledge — a free, instant, and reliable way to verify any job post before responding, paying, or sharing personal information. We wanted a tool trained specifically on Indian job data, not just generic global datasets.

🤖 How the AI works

At the core of Fake Job Detector India is a Random Forest machine learning classifier. Random Forest is an ensemble method that builds hundreds of decision trees during training and combines their predictions for higher accuracy and robustness than any single decision tree.

Job post text is first converted into numerical features using TF-IDF (Term Frequency–Inverse Document Frequency) vectorization. TF-IDF identifies which words and phrases are especially significant in distinguishing fake jobs from real ones — for example, terms like "registration fee," "weekly payment," "WhatsApp only," and "no experience needed" score very high in fake job posts.

On top of the ML model, we apply a layer of India-specific rule-based detection that catches local scam patterns the global training data might miss: unrealistically high salary figures in rupees, mentions of specific fake IT company names, requests for Aadhaar or PAN upfront, and WhatsApp-only contact patterns.

The final result combines the ML probability score with the rule-based flags into a single Risk Score (0–100) and a verdict: Safe, Suspicious, or High Risk.

Random Forest

ML classifier

TF-IDF

Text vectorization

scikit-learn

ML framework

Flask

Python backend

EMSCAD

Global dataset

Naukri India

Indian dataset

📊 Model accuracy and training data

Our model was trained on a combined dataset of 96,674 labeled job postings:

EMSCAD dataset — 17,880 global job posts labeled as real or fake, widely used in academic fake-job detection research
Naukri India dataset — 78,794 real Indian job postings from Naukri.com, providing deep coverage of the Indian job market vocabulary and patterns

The model achieves the following metrics on the held-out test set:

99.67%

Classification accuracy

0.7789

F1 score (fake class)

96,674

Training samples

100+

India-specific rules

A note on the F1 score: the fake-job class is inherently imbalanced (real jobs outnumber fake ones). An F1 of 0.7789 on the fake class means our model correctly identifies most fake jobs while keeping false positives low — a practical balance for real-world use.

🔒 Our privacy promise

We never store the job post text you submit for analysis
We never require registration, login, or any personal information
We do not sell any user data to third parties
We use Google Analytics to understand aggregate traffic, but it does not identify individuals
The tool is completely anonymous — not even your IP is logged for job submissions
You can read our full Privacy Policy here

👤 About the developer

Fake Job Detector India was built by Amit Mastud, an independent developer passionate about using machine learning to solve real problems that affect everyday Indians. The project started as a personal initiative after observing how many people in his network had received — and sometimes fallen victim to — fake job scams on WhatsApp.

The project is open source. All code, including the training pipeline and Chrome extension, is available on GitHub. Contributions, suggestions, and bug reports are welcome.

📣 How to report a fake job

If you've found a fake job post — whether our tool flagged it or you noticed it yourself — please report it to:

National Cybercrime Reporting Portal: cybercrime.gov.in
National Cybercrime Helpline: 1930
The platform where you found it (Naukri, LinkedIn, Indeed all have reporting options)
Your local police cyber cell if you have already been defrauded