About Fake Job Detector India

Learn why we built this tool, how the AI works, and our promise to Indian job seekers.

🎯 Why we built this

Every single day, millions of Indians receive fake job offers on WhatsApp, email, Naukri, and LinkedIn. Scammers impersonate well-known companies like Infosys, TCS, Wipro, and even government ministries. They promise high-paying roles and then ask victims to pay a "registration fee," "security deposit," or "training cost" β€” often ranging from β‚Ή500 to β‚Ή50,000.

These scams target India's most vulnerable job seekers β€” freshers, people from small towns, those recently laid off, and anyone desperate for income. According to government cybercrime data, job fraud is one of the fastest-growing categories of online scams in India, with thousands of complaints filed every month.

Fake Job Detector India was built to give every job seeker β€” regardless of technical knowledge β€” a free, instant, and reliable way to verify any job post before responding, paying, or sharing personal information. We wanted a tool trained specifically on Indian job data, not just generic global datasets.

πŸ€– How the AI works

At the core of Fake Job Detector India is a Random Forest machine learning classifier. Random Forest is an ensemble method that builds hundreds of decision trees during training and combines their predictions for higher accuracy and robustness than any single decision tree.

Job post text is first converted into numerical features using TF-IDF (Term Frequency–Inverse Document Frequency) vectorization. TF-IDF identifies which words and phrases are especially significant in distinguishing fake jobs from real ones β€” for example, terms like "registration fee," "weekly payment," "WhatsApp only," and "no experience needed" score very high in fake job posts.

On top of the ML model, we apply a layer of India-specific rule-based detection that catches local scam patterns the global training data might miss: unrealistically high salary figures in rupees, mentions of specific fake IT company names, requests for Aadhaar or PAN upfront, and WhatsApp-only contact patterns.

The final result combines the ML probability score with the rule-based flags into a single Risk Score (0–100) and a verdict: Safe, Suspicious, or High Risk.

Random Forest
ML classifier
TF-IDF
Text vectorization
scikit-learn
ML framework
Flask
Python backend
EMSCAD
Global dataset
Naukri India
Indian dataset

πŸ“Š Model accuracy and training data

Our model was trained on a combined dataset of 96,674 labeled job postings:

  • EMSCAD dataset β€” 17,880 global job posts labeled as real or fake, widely used in academic fake-job detection research
  • Naukri India dataset β€” 78,794 real Indian job postings from Naukri.com, providing deep coverage of the Indian job market vocabulary and patterns

The model achieves the following metrics on the held-out test set:

99.67%
Classification accuracy
0.7789
F1 score (fake class)
96,674
Training samples
100+
India-specific rules

A note on the F1 score: the fake-job class is inherently imbalanced (real jobs outnumber fake ones). An F1 of 0.7789 on the fake class means our model correctly identifies most fake jobs while keeping false positives low β€” a practical balance for real-world use.

πŸ”’ Our privacy promise

  • We never store the job post text you submit for analysis
  • We never require registration, login, or any personal information
  • We do not sell any user data to third parties
  • We use Google Analytics to understand aggregate traffic, but it does not identify individuals
  • The tool is completely anonymous β€” not even your IP is logged for job submissions
  • You can read our full Privacy Policy here

πŸ‘€ About the developer

Fake Job Detector India was built by Amit Mastud, an independent developer passionate about using machine learning to solve real problems that affect everyday Indians. The project started as a personal initiative after observing how many people in his network had received β€” and sometimes fallen victim to β€” fake job scams on WhatsApp.

The project is open source. All code, including the training pipeline and Chrome extension, is available on GitHub. Contributions, suggestions, and bug reports are welcome.

πŸ“£ How to report a fake job

If you've found a fake job post β€” whether our tool flagged it or you noticed it yourself β€” please report it to:

  • National Cybercrime Reporting Portal: cybercrime.gov.in
  • National Cybercrime Helpline: 1930
  • The platform where you found it (Naukri, LinkedIn, Indeed all have reporting options)
  • Your local police cyber cell if you have already been defrauded

Reporting fake jobs helps authorities track fraudsters and protects other job seekers. It takes less than 5 minutes.