Skip to content

KatrielMoses/MailAccess

Repository files navigation

███╗   ███╗ █████╗ ██╗██╗      █████╗  ██████╗ ██████╗███████╗███████╗███████╗
████╗ ████║██╔══██╗██║██║     ██╔══██╗██╔════╝██╔════╝██╔════╝██╔════╝██╔════╝
██╔████╔██║███████║██║██║     ███████║██║     ██║     █████╗  ███████╗███████╗
██║╚██╔╝██║██╔══██║██║██║     ██╔══██║██║     ██║     ██╔══╝  ╚════██║╚════██║
██║ ╚═╝ ██║██║  ██║██║███████╗██║  ██║╚██████╗╚██████╗███████╗███████║███████║
╚═╝     ╚═╝╚═╝  ╚═╝╚═╝╚══════╝╚═╝  ╚═╝ ╚═════╝ ╚═════╝╚══════╝╚══════╝╚══════╝

License: MIT Python 3.11+ Docker PyPI version PyPI Downloads

Self-hostable OSINT platform for investigating email addresses. Fan out across breach databases, social networks, DNS records, and the open web — get back a unified exposure score and structured findings you can export or pipe into Maltego.

Built for security researchers, OSINT analysts, and penetration testers operating under authorization. Read DISCLAIMER.md before use.

Install

CLI only (no Docker)

pip install mailaccess

# Option A: auto-start (simplest)
mailaccess investigate you@example.com
# Server starts automatically, runs investigation,
# stops when done.

# Option B: keep server running
mailaccess serve  # in one terminal
mailaccess investigate you@example.com  # in another

# Option C: full stack with Web UI
git clone https://github.com/YOUR_USERNAME/mailaccess
docker compose up -d

Quick Start

mailaccess investigate you@example.com
mailaccess investigate you@example.com -o report.pdf
mailaccess investigate you@example.com --format jsonl
mailaccess investigate -                        # read email from stdin
mailaccess serve                                # start backend server on :8000
mailaccess keys list
mailaccess keys set HIBP_API_KEY your-key-here
mailaccess modules
mailaccess platform-health                      # inspect noisy/failing probes

# Enable specific opt-in modules for one run
mailaccess investigate email -m breach_deep
mailaccess investigate email -m all

Investigation demo

What It Does

  • Identity graph — cross-platform correlation of accounts, usernames, and signals from each investigation
  • Name Consensus Engine — confirms real identity from multiple independent name signals with confidence scoring
  • Defender's Brief — security-manager-ready risk summary with actionable findings and next step
  • Phone number recovery — pipeline to surface and validate numbers tied to the target
  • Telegram / WhatsApp hints — lightweight messaging-app footprint checks alongside other modules
  • YAML-driven platform system — social-style checks defined in backend/platforms/; community extensible without new Python for each site
  • Native Maigret engine — 2500+ platform coverage without a Maigret runtime dependency, including regional, niche, and international platforms not covered by WMN
  • Catch-all detection — excludes platforms that return false positives for arbitrary usernames before the sweep starts
  • Platform deduplication — merges WMN and Maigret results by profile URL domain so confirmed platforms are not double-counted
  • Avatar perceptual hashing — confirms the same person across platforms by image similarity
  • False-positive reduction — common-name filtering, content-length sanity checks, and multi-language failure signals reduce noisy hits
  • Disposable email detection — flags disposable domains and downweights affected enumeration findings
  • Bio fuzzy similarity — links profiles with materially similar free-form bios
  • Platform health — automatically demotes unreliable platforms and self-heals as reliability changes
  • Platform audit CLI — inspect platform reliability with mailaccess platform-audit
  • Temporal clustering and shadow profiles — surfaces coordinated signup windows and same-name accounts tied to alternate emails
  • Deep breach mode — checks top 100 highest-severity breached sites for account existence
  • Historical intelligence — Wayback Machine archive search + GitHub commit author search
  • Recursive email discovery — recovers other emails owned by the same person via name correlation
  • Credential Risk Score — separate 0-100 credential risk signal with LOW / MODERATE / HIGH / CRITICAL banding, top drivers, and recommended next steps
  • Concurrent module execution — all modules run in parallel, results stream as they arrive
  • WebSocket streaming — partial results arrive in real time without polling
  • REST API + web UI + CLI — use whatever interface fits your workflow
  • Plugin module system — drop a .py file in backend/modules/ and it auto-registers; no wiring required
  • 6 export formats: JSON, CSV, PDF, Markdown, STIX 2.1, Maltego XML
  • Maltego local transform server — run investigations directly from the Maltego desktop app
  • Webhook notifications — Slack, Discord, or any HTTP endpoint
  • Exposure score (0–100) with risk label: low / medium / high / critical
  • SQLite by default; PostgreSQL optional via Docker Compose profile

Modules

Module Coverage Key Required Opt-in
gravatar Profile hash lookup No No
hibp Breach check Yes No
breach_deep Probes top 100 highest-severity breached sites for account existence No (HIBP corpus fetched automatically) Yes
emailrep Reputation + blacklist No No
hudson_rock Infostealer logs (free) No No
google_dork 5 automated dorks Yes (SerpAPI) No
email_discovery Recovers other email addresses owned by same person via name dorks Yes (SERPAPI_KEY) No
domain_intel Domain + Shodan No (Shodan optional) No
dns_lookup MX/SPF/DMARC/DKIM/A/NS extraction No No
whois_lookup Domain WHOIS, privacy detection No No
wayback Finds historical pages where email appeared publicly via Wayback Machine CDX No No
github_commits Finds repos committed to with this email, surfaces real name from git config. Requires GITHUB_TOKEN for commit search; user profile search works without token. No (GITHUB_TOKEN optional, required for commit search) No
pgp_keyserver PGP key UID name lookup No No
orcid_lookup ORCID researcher identity No No
hackernews HackerNews profile name No No
sec_edgar SEC EDGAR filing contact extraction No No
companies_house UK Companies House officers Yes (COMPANIES_HOUSE_API_KEY, free) No
press_intel Press release contact extraction No Yes
xposedornot Default-on direct email-to-breach corpus lookup with breach names, data classes, and risk indicators No No
leakcheck Default-on public breach corpus lookup with regional coverage and stealer routing No No
ransomware_intel Default-on domain victim correlation against ransomware lists; skips free providers No No
social 13 platforms via YAML No No
social_links Username extraction, feeds pivot No No
account_discovery Holehe 120+ platforms No Yes
user_scanner 205+ platform vectors No Yes
whatsmyname 700+ platforms No Yes
maigret_platforms Native Maigret platform engine, 2500+ platforms No No (disable via ENABLE_MAIGRET_PLATFORMS=false)
sherlock_platforms Sherlock native engine, ~300 platforms No No
nexfil_platforms Nexfil native engine, ~300 platforms No No
blackbird_platforms Blackbird native engine, social focus No No
breachdirectory 2nd breach source Yes No
username_pivot WMN via recovered usernames No Yes
permutation_discovery 60 email variants No Yes
phone_intel Phone validation + WA/TG hints No No
messaging_hints Telegram/WhatsApp username check No No
ghunt Gmail deep intel No (setup required) Yes
identity_graph Cross-platform cluster analysis No No (automatic)
platform_health Persistent probe health, fragility, and skip decisions No No (automatic)
temporal_cluster Coordinated account-creation windows No No (automatic)
shadow_profiles Same-name accounts tied to alternate emails No No (automatic)
avatar_clusters Cross-platform perceptual-avatar clusters No No (automatic)
breach_corpus Cached and severity-ranked public HIBP breach catalog No No (used by breach_deep)
common_names Common-name and username false-positive controls No No (automatic)
disposable_domains Disposable-email confidence controls No No (automatic)

55 modules · 2500+ platforms by default

Platform Coverage

MailAccess checks usernames derived from the target email across multiple platform databases:

Source Platforms Default
WhatsMyName 700+ On
Holehe 120+ On
user-scanner 205+ On
Maigret native engine 2500+ On
Sherlock native ~300 On
Nexfil native ~300 On
Blackbird native social focus On

Total with Maigret enabled: 2500+ unique platforms after deduplication.

Enable Maigret:

ENABLE_MAIGRET_PLATFORMS=true mailaccess investigate email

Enable Maigret + Wave 2, the slower platform sweep:

ENABLE_MAIGRET_PLATFORMS=true ENABLE_MAIGRET_WAVE2=true mailaccess investigate email

The platform database is fetched from Maigret's GitHub repository (MIT licensed) and cached locally for 24 hours. Custom platforms can be added to data/mailaccess-extra-sites.json in the same format.

Findings from WMN and Maigret are deduplicated by URL domain. When both tools confirm the same platform, the finding is marked dual-confirmed with high confidence.

Variable Module Key Required Default Description
ENABLE_MAIGRET_PLATFORMS maigret_platforms None false Enable 2500+ platform sweep. Adds ~35-90s.
ENABLE_MAIGRET_WAVE2 maigret_platforms (Wave 2) None false Enable slow/fragile platform sweep. Requires ENABLE_MAIGRET_PLATFORMS=true. Adds ~90-150s.
MAIGRET_FORCE_{PLATFORM} maigret_platforms None (unset) Override auto-demotion for one platform.
MAILACCESS_SHARE_HEALTH platform-health None false Opt in to anonymized health sharing; sharing still requires --share.
DOMAIN_CLUSTER_CAP domain_cluster None 20 Maximum domains checked per infrastructure cluster pass.

Identity Graph

Every investigation generates a cross-platform identity graph linking accounts by shared usernames, photos, display names, and breach data. View at:

/investigation/:id/graph

Export as D3-compatible JSON via GET /api/report/{id}/graph or fetch clusters with confidence scores via GET /api/report/{id}/clusters.

Findings are automatically grouped into identity clusters with confidence scoring. Use --show-collisions to expand low-confidence matches in CLI output.

Name Consensus Engine

MailAccess collects name signals from every module that returns profile data: GitHub, Gravatar, Keybase, PGP keys, ORCID, LinkedIn, git commits, and more. The Name Consensus Engine synthesizes those signals into a single defensible output:

CONFIRMED IDENTITY
  Name:     Katriel Moses  [CONFIRMED]
  Sources:  GitHub · Gravatar · Keybase · PGP
  Reasoning: 4 independent sources agree.

Confidence bands:

  • Confirmed: 3+ independent sources, score >= 2.5
  • Probable: 2+ sources, score >= 1.5
  • Possible: single source, score >= 0.5
  • Unknown: no reliable name signals

Role/system email addresses (noreply@, admin@, support@, info@, and similar) are automatically detected and skipped.

Defender's Brief

Every investigation includes a Defender's Brief: a 30-second risk summary designed for security managers, not just analysts.

DEFENDER'S BRIEF
  Risk:    CRITICAL
  Summary: Active infostealer infection detected.
  1. Active credential theft   [CRITICAL]
     Infostealer detected via Hudson Rock.
     -> Rotate credentials immediately.
  2. Email in 8 breaches       [HIGH]
     Spanning 2012-2024.
     -> Audit password reuse.
  3. Real identity confirmed   [HIGH]
     John Doe - 2 independent sources.
     -> Review public profile exposure.
  Next action: Immediately rotate credentials and enforce hardware MFA.

Suppress it with --no-brief.

Historical Intelligence

MailAccess searches the Wayback Machine CDX API for archived pages where the email appeared publicly — catching deleted blog posts, old forum signatures, and removed contact pages.

GitHub commit history is searched by author email, revealing repos contributed to, real name from git config, and development activity timeline.

Deep Breach Mode

Enable with ENABLE_BREACH_DEEP=true.

Fetches the full HIBP breach corpus on startup, ranks sites by severity (record count × data class multipliers), then probes the top 100 highest-severity sites for account existence via YAML probes and generic reset-flow inference. Findings show breach name, record count, data classes, and severity — giving analysts a probabilistic credential exposure estimate.

Example output:

⚠ adobe.com    CRITICAL  153M records
  [Passwords, Email, Password hints]
✓ dropbox.com  HIGH       69M records
  [Email, Passwords]
~222M records across 2 breaches potentially include this email's credentials

Pipeline

MailAccess is pipeline-friendly: read target emails from stdin, stream JSONL output, and branch on exit codes in CI/CD scripts.

# Batch from file
cat emails.txt | mailaccess investigate -

# Stream JSONL
mailaccess investigate you@example.com --format jsonl | jq .

# Filter critical findings
mailaccess investigate you@example.com --format jsonl | jq 'select(.severity=="critical")'

Exit codes: 0 clean · 1 findings · 2 breaches · 3 error

See docs/integrations.md for GitHub Actions examples.


Adding a Platform

No Python required. Drop a YAML file in backend/platforms/:

cp backend/platforms/TEMPLATE.yaml backend/platforms/mysite.yaml

Edit fields, submit PR.

See CONTRIBUTING.md for full guide.

Export Formats

Format ?format= value Use case
JSON json Programmatic use, archiving
CSV csv Spreadsheet analysis
PDF pdf Human-readable reports
Markdown markdown Wikis, issue trackers
STIX 2.1 stix Threat intelligence platforms
Maltego XML maltego Maltego graph import

Integrations

Integration How
Maltego Local transform server at POST /maltego/email_investigate (no API key required)
Slack Set SLACK_WEBHOOK_URL in .env
Discord Set DISCORD_WEBHOOK_URL in .env
Generic webhook INTEGRATION_WEBHOOK_URL + optional INTEGRATION_WEBHOOK_SECRET (HMAC)

Self-Hosting

cp .env.example .env      # all API keys are optional
docker compose up         # backend :8000  ·  frontend :3000

Open http://localhost:3000 in your browser. Full setup guide: docs/self-hosting.md.

Platform Health & Self-Healing

MailAccess tracks probe outcomes for every platform it touches in ~/.mailaccess/platform_health.db. Phase 6C makes that data visible; Phase 6D makes it actionable. After every investigation, the system:

Note: mailaccess platform-audit shows platforms that have been probed in your local investigations. This number grows over time. The full platform database (2500+) is checked during every investigation regardless of how many appear in the health DB.

  • Auto-skips platforms with > 70% inconclusive probes over the last 30 days (only when those probes were collected in the last 14 days — stale stats never trigger a re-skip).
  • Auto-demotes Wave-1 platforms with > 40% inconclusive probes to Wave 2 for the current investigation.
  • Auto-upgrades Wave-2 platforms with < 10% inconclusive probes (and recent activity) back into Wave 1.

Every auto-action writes a JSONL entry to ~/.mailaccess/platform_demotion.log with timestamp, platform, action, reason, stats, and the env-var override hint.

Platforms are classified as:

  • KEEP — reliable; no action needed
  • WATCH — insufficient data
  • DEMOTE — noisy; moved to Wave 2
  • SKIP — sustained failure; excluded

Inspecting auto-actions

mailaccess platform-audit                 # show SKIP / DEMOTE / KEEP / WATCH per platform
mailaccess platform-audit --recommend-skip   # only SKIP candidates, with override instructions
mailaccess platform-audit --show-demotions   # only platforms that were auto-demoted, with stats + override env-var hints
mailaccess platform-health                # browse raw per-platform stats

Overriding auto-demotion

Every auto-demotion is reversible via env var. The mapping rule: take the platform name, strip non-alphanumeric characters, uppercase it, and prefix with MAIGRET_FORCE_. So NoisySite.com becomes MAIGRET_FORCE_NOISYSITECOM.

For example, override GitHub with MAIGRET_FORCE_GITHUBCOM=true.

# Run with NoisySite.com forced to its native wave, ignoring health stats:
MAIGRET_FORCE_NOISYSITECOM=true mailaccess investigate user@example.com

Community health sharing (opt-in)

You can contribute anonymized platform health stats to a public GitHub Gist:

mailaccess platform-health --share

This is strictly opt-in. The flag is the only way this code path executes — no background jobs, no scheduled tasks, no automatic upload. The payload contains platform-level metadata only (hit / miss / inconclusive rates, average latency, total probes, last probed). No user data, no email addresses, no investigation targets. Set MAILACCESS_SHARE_HEALTH=false in .env (the default) — the env var is documentation only; the CLI requires the explicit --share flag.

False Positive Control

MailAccess applies multiple layers before accepting platform-enumeration hits:

  • Absence strings are applied to every check type, including status-code checks.
  • A 200 OK response under 500 bytes is treated as inconclusive.
  • HTML entities are decoded before pattern matching.
  • Usernames matching the top-1000 common-name corpus are downweighted without corroboration.
  • Failure signals cover English, German, French, Spanish, and Portuguese.
  • Catch-all platforms that return hits for non-existent users are excluded.

See docs/fp-control.md for the full control model.

CLI Reference

Command Description
mailaccess investigate <email> Run a full investigation against an email address
mailaccess investigate - Read target email from stdin
mailaccess serve Start the backend server on :8000
mailaccess history List past investigations
mailaccess keys list Show all configured API keys
mailaccess keys set <KEY> <value> Set an API key
mailaccess keys unset <KEY> Remove an API key
mailaccess config set-url <url> Point the CLI at a MailAccess instance
mailaccess modules List all available modules
mailaccess commands List all CLI commands
mailaccess platform-health Inspect, export, or clear persistent platform probe health
mailaccess platform-health --share Opt-in: post anonymized platform health stats to a public Gist
mailaccess platform-audit Inspect platform reliability, ranked by noise rate
mailaccess platform-audit --show-demotions Show only platforms that were auto-demoted in the last 24h, with override env-var instructions
mailaccess platform-audit --recommend-skip Show only SKIP candidates, with MAIGRET_FORCE_<PLATFORM> override hints
mailaccess investigate <email> -m / --enable Enable opt-in modules for this run only. Comma-separated or all. Example: -m breach_deep,ghunt
mailaccess investigate <email> --no-brief Suppress Defender's Brief section

The --output / -o flag on investigate saves the report to a file. The extension determines the format: .json, .csv, .pdf, .md, .stix.json, .maltego.csv.

When a bare filename is given (no directory component), the file is written to the results/ directory automatically (e.g. -o report.jsonresults/report.json). This directory is git-ignored so investigation outputs are never accidentally committed. Absolute or relative paths that include a directory component (e.g. -o /tmp/report.json) are written as-is.

API Keys

Key Module Where to get it Required?
HIBP_API_KEY hibp https://haveibeenpwned.com/API/Key Yes (module skips without it)
SERPAPI_KEY google_dork https://serpapi.com Yes (module skips without it)
SHODAN_API_KEY domain_intel https://account.shodan.io No
EMAILREP_API_KEY emailrep https://emailrep.io No
HUNTER_IO_API_KEY hunter_io https://hunter.io No
GITHUB_TOKEN github_commits https://github.com/settings/tokens No (optional)
COMPANIES_HOUSE_API_KEY companies_house https://developer.company-information.service.gov.uk No (free forever, no CC)
SLACK_WEBHOOK_URL Webhooks https://api.slack.com/messaging/webhooks No
DISCORD_WEBHOOK_URL Webhooks Discord server settings No

Changelog

0.9.0

PHASE 1 — False Positive Killers

  • Absence strings applied to status_code checks
  • Content-length sanity check on 200 responses
  • HTML entity decoding before pattern matching
  • Common-name filter (1000-entry corpus)
  • Multi-language reset signals (DE/FR/ES/PT)
  • Disposable-domain detection (15k+ domains)

PHASE 2 — Reasoning Layer

  • Avatar perceptual hashing (imagehash, Hamming ≤ 5)
  • Bio fuzzy matching (RapidFuzz, 25 aggregators)
  • Persistent platform health (SQLite, rolling window)
  • Temporal clustering and shadow-profile detection
  • Name consensus: fuzzy merge, token_set_ratio, non-Western Unicode, temporal decay, and high-trust single-word names

PHASE 3 — Platform Expansion

  • Sherlock native engine (~300 platforms)
  • Nexfil native engine (~300 platforms)
  • Blackbird native engine (social focus)
  • SOURCE_PRIORITY dedup hierarchy

PHASE 4 — Output Hardening

  • Platform dedup strips 21 subdomain prefixes
  • Bio aggregator expanded from 7 to 25 domains
  • Credential risk uses YAML service categories
  • Breach normalizer polish

PHASE 6 — Feedback Loop

  • Name consensus: fuzzy, temporal, and Unicode handling (6A)
  • Domain clustering and shadow profiles V2 (6B)
  • Platform audit CLI: mailaccess platform-audit (6C)
  • Self-healing platform DB: auto-demotion, auto-upgrade, demotion log, and opt-in community sharing (6D)

ENGINE

  • Per-module asyncio timeout enforcement
  • Zombie investigation recovery on startup
  • Graph construction optimized for large datasets
  • All platform-health calls made async

0.8.1

  • maigret_platforms now default-on (2500+ platforms checked in every investigation)
  • Wave 2 remains opt-in via ENABLE_MAIGRET_WAVE2
  • ENABLE_MAIGRET_PLATFORMS=false to disable if investigation speed is a priority

0.8.0

  • Native Maigret platform engine: 2500+ platforms without Maigret runtime dependency
  • Two-wave architecture: Wave 1 is the fast default when enabled; Wave 2 adds slower and more fragile platforms
  • Catch-all detection: validates platforms against known-unclaimed usernames before sweep
  • Platform deduplication: WMN + Maigret merged by URL domain, dual-confirmed findings marked high confidence
  • Custom platform additions via data/mailaccess-extra-sites.json
  • ENABLE_MAIGRET_PLATFORMS env var, default false
  • ENABLE_MAIGRET_WAVE2 env var, default false
  • Platform database auto-refreshed every 24h from Maigret GitHub (MIT licensed)

0.7.0

  • Name Consensus Engine: synthesizes name signals from all profile modules into Confirmed/Probable/Possible/Unknown with reasoning and source list
  • Defender's Brief: risk-first output with top 3 actionable findings and concrete next step. Suppressed with --no-brief.
  • PGP keyserver: email to UID name lookup via keys.openpgp.org, weight 1.0, highest trust
  • ORCID: researcher identity lookup, institutional verified names, weight 0.95
  • HackerNews profile: name extraction from about field via Firebase and Algolia APIs
  • SEC EDGAR: phone/contact extraction from public filings for business domains, no key
  • Companies House UK: officer names and registered address, free key required
  • Press intel: press release contact extraction, opt-in via -m press_intel
  • WHOIS/RDAP phone extraction: surviving post-GDPR registrars now surface phone numbers
  • Role/system email detection: noreply@, admin@, support@, and similar addresses skip name inference automatically
  • Name shown in summary bar when confirmed/probable

0.6.5

  • QA pass: cosmetic label fixes, keybase 404 handling, WebSocket large payload fix
  • github_user, twitter_profile, linkedin_snippet display names corrected in identity clusters
  • Alias normalization original email now passed to all profile extraction modules
  • Timeline builder wired to all breach sources
  • Profile intelligence and PII findings in all export formats

0.5.3

  • Cluster identity analysis no longer shows raw traceback on timeout — shows dim fallback message instead
  • Hardcoded minimum timeout floors for pip-installed users: account_discovery 120s, username_pivot 60s, user_scanner 180s, whatsmyname 200s
  • .env overrides still win if set higher

0.5.2

  • Config resilience: CORS_ORIGINS and dict fields now accept plain strings, comma-separated values, and empty strings without crashing
  • No more SettingsError on first run with default .env
  • Startup confirmation line shows config parsed correctly

0.5.1

  • LeakCheck integration: free corpus lookup, covers CIS/regional breaches XposedOrNot misses
  • XposedOrNot paste signals surfaced separately from breach signals in CLI and summary bar
  • Ransomware domain victim correlation: checks email domain against ransomware victim lists (ransomware.live + ransomlook.io)
  • Summary bar now shows three-part breakdown: Breaches: X | Pastes: Y | Stealer: Z
  • LeakCheck stealer category correctly routed to stealer signal count not breach count
  • Removed legacy credential_risk: null from JSON export

0.5.0

  • XposedOrNot integration: free direct breach corpus lookup, no API key, default-on, closes ~70-80% of HIBP coverage gap
  • Breach normalizer: deduplicates breach findings across all sources into single canonical records with source attribution
  • Credential Risk Score: separate 0-100 score with band, top 3 score drivers, and recommended analyst actions. Infostealer hit forces CRITICAL. Surfaces in CLI, UI, all exports, and webhooks.

0.4.3

  • github_commits: returns PARTIAL (not FAILED) without GITHUB_TOKEN, includes setup hint
  • whois_lookup: IANA-managed domains now parse correctly, timezone-aware datetime fix, richer field extraction (organisation, nserver, registered, expires)

0.4.2

  • Default modules now run without any flags: whatsmyname, account_discovery, user_scanner, username_pivot, permutation_discovery, phone_intel, messaging_hints
  • -m / --enable flag for opt-in modules per run (breach_deep, ghunt, email_discovery)
  • -m all enables all three opt-in modules
  • Invalid -m module name shows helpful warning

0.4.1

  • Deep breach mode and email discovery improvements
  • Phone extractor false positive fixes carried forward

0.4.0

  • Deep breach mode: probes top 100 highest-severity breached sites for account existence (opt-in, ENABLE_BREACH_DEEP=true)
  • Name → email discovery: recovers other email addresses owned by same person via SerpAPI dorks (requires SERPAPI_KEY)
  • Wayback Machine: CDX search for historical pages where email appeared publicly
  • GitHub commit search: author-email search across all public commits, surfaces repos + real name from git config (GITHUB_TOKEN optional)
  • Breach corpus: auto-fetched from HIBP public API, severity-ranked by record count × data class multipliers, cached 24h

Troubleshooting

Troubleshooting demo

Links

Self-hosting guide Docker Compose, .env reference, PostgreSQL, proxy/Tor, Maltego setup
Module reference All modules, findings schema, adding new modules
False-positive controls Common-name, disposable-domain, clustering, health, and scoring controls
API reference REST endpoints, WebSocket events, authentication
Export formats Supported formats, MIME types, filename conventions
Integrations Maltego, Slack, Discord, generic webhooks
Contributing Adding modules, adding exporters, code style, PR checklist
PyPI pip install mailaccess
GitHub Source code, issues, releases

License

MIT. All data queried by MailAccess comes from public sources. See DISCLAIMER.md for authorized use cases and legal responsibility.