Skip to content

FLN (Flintshire) scraper failing — server unreachable (TCP timeout) #350

Description

@symroe

Error (2026-06-11)

exceptions.ConnectionError: wreq::Error { kind: Connect, source: timed out }
Duration: 30s

Investigation

The scraper target https://cyfarfodyddpwyllgor.siryfflint.gov.uk (Flintshire's Welsh-language ModGov portal) is unreachable — TCP connections time out after 30 seconds. Direct curl requests to the endpoint also time out (exit code 28), confirming this is not a wreq-specific issue but a genuine network-level block or server downtime.

The scraper already has verify_requests = False set, so this is not a certificate issue.

Flintshire's ModGov instance appears to be blocking connections from Lambda's IP range, or the server itself is down.

Fix patterns ruled out

  1. HTTPS migration — already HTTPS
  2. verify_requests = False — already set
  3. http_lib = "playwright" — TCP timeout occurs before any HTTP exchange; playwright cannot help when the TCP layer is blocked
  4. URL changes — no evidence the URL has changed; the Welsh-language portal is the known address

What needs to happen

Check whether cyfarfodyddpwyllgor.siryfflint.gov.uk is temporarily down or has been decommissioned. If Flintshire has moved their ModGov instance, the base_url in metadata.json needs updating. If it's an IP block, use_proxy = True may be needed.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions