Atomic DeFake

Description
Running the app
Guide
Things to note

Description

Prototype made by Shohail Ismail, Michiel van der Meer, and Alessio Xompero for the 'Idiap Research Institute - Create Challenge 2024' hackathon to detect misinformation in text posts. Uses Streamlit UI and Mistral API pipeline to generate atomic assessment questions for fact-checking, combining AI and human answers to aggregate final trustworthiness verdicts. Also provides feedback for iterative content improvement, configurable as an automated, corrective-feedback loop for scheduled posts. Next steps involve training on AVeriTeC for enhanced accuracy and explainability.

Running the app

Requirements

Anaconda
Python 3.11

Installation

conda create -n atomic-defake python=3.11

conda activate atomic-defake

pip install -r requirements.txt

Setup

Create .env file in the project root and put in your Mistral API key as follows:

MISTRAL_API_KEY=<YOUR_KEY>

Run

streamlit run ui.py

Guide

After running the app, a Streamlit interface will open in your browser. After logging in (credentials aren't needed), click 'User post' from the sidebar and input text to verify for misinformation, then click the 'AtomicDeFake' button.
You will then be taken to the 'Contributor' screen where you can answer AI-generated questions about the post and give a final confidence score in your answers.
- The reason for this is that AtomicDeFake works on a reciprocal crowdsourcing system, so to get your post verified, you must answer 5 questions on 2 different users' posts. Though for the sake of this demo, you will be answering the same questions on your own post twice.
After doing this, the page will run the aggregator, which combines AI analysis with what other people have said about your post, thus exhibiting an AI + Human-In-The-Loop (HITL) system.
If the post is deemed to contain misinformation, the human + AI responses are given, with a prompt to rewrite and resubmit your post, else the post is verified and deemed to be factual.

Things to note

The aggregator is precision-oriented and risk-averse, favouring false negatives over false positives (except certain edge cases detailed below). This means that two abstentions by contributors vetoes the verification regardless of AI responses. It also means that absolute quantifiers (always/never/zero) will often nudge the evaluation AI towards declaring the post false, which encourages nuance in language.
- TEST: consistent "I don't know" answers
- FUTURE: fine-tuning the model to better distinguish and setting up AI-overrule thresholds/consensus-forcing question loops for abstention
The question-generation AI operates at clause-level granularity so that false subclaims are not ignored in favour of more/larger true (sub)claims; in other words, it splits post text into atomic claims. This characteristic is also embodied by the truthfulness evaluation AI, owing to its risk-averse nature, meaning that one erroneous sub-claim voids the whole post.
- TEST: multi-claim text, e.g., "The sky is blue and clouds are green", “mRNA COVID-19 vaccines contain microchips and reduce severe disease risk”, etc.
- FUTURE: the current mechanism is fine, though feedback could be made more granular (per sub-claim).
The aggregator is currently primed to trust human responses implicitly over AI, which decreases distrust from public and risk of hallucinations swaying consensus. However, this also means that, if a clearly factual statement ("The sky is blue") is deemed by at least one of the human contributors to be false, it is voided as such - even if the AI and other human recognise the text as factual.
- TEST: treating factual questions as false, and answering the HITL questions as such.
- FUTURE: implement a form of reputation and disagreement audits to gain more information on reasons - if given reasons are weak, decrease reputation.

Name		Name	Last commit message	Last commit date
Latest commit History 53 Commits
atomic_defake		atomic_defake
.gitignore		.gitignore
README.md		README.md
contributor.py		contributor.py
plot.py		plot.py
requirements.txt		requirements.txt
ui.py		ui.py
user_post.py		user_post.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Atomic DeFake

Description

Running the app

Requirements

Installation

Setup

Run

Guide

Things to note

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Atomic DeFake

Description

Running the app

Requirements

Installation

Setup

Run

Guide

Things to note

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages