Deepfakes & Misinformation
Lesson 8.4 — Deepfakes: Detection, Reality, and Media Literacy
The word "deepfake" entered common usage around 2017, when AI-generated face-swapping technology became accessible to people without specialist skills. Since then, the technology has advanced dramatically — and the range of harms it enables has grown with it. In 2024, AI-generated synthetic media is no longer a niche concern. It is a mainstream challenge that touches electoral politics, personal safety, fraud, and how we evaluate evidence.
This lesson covers what deepfakes are technically, how to detect them, the current reality of voice cloning, available detection tools, and the SIFT media literacy framework adapted for AI content.
What Deepfakes Are
"Deepfake" originally referred specifically to face-swapping technology (deep learning + fake), but the term now broadly covers:
- Video deepfakes: A person's face replaced with another's in video; a person made to appear to say something they didn't say
- Audio deepfakes: A synthetic voice cloned from a real person, capable of saying anything
- Image deepfakes: Generated images of real-looking but non-existent people, or manipulated images of real people
- Text deepfakes: AI-generated text designed to impersonate a specific person's writing style
The common thread is synthetic media designed to look, sound, or feel authentic.
Detection: Visual Tells
Deepfake detection is a moving arms race. As detection improves, generation improves, and vice versa. However, current deepfake video still has detectable tells in many cases:
| Tell | What to look for |
|---|---|
| Facial edge blurring | The boundary between the swapped face and neck/hair often has a subtle blurring or shimmer |
| Eye movement anomalies | Blinking patterns may be irregular; eyes may not track naturally |
| Inconsistent lighting | The face and the environment may have different light sources or shadows |
| Hair and fine detail | Individual hairs and teeth are notoriously difficult to synthesise convincingly |
| Unnatural skin texture | Skin may look unusually smooth or have a slight waxy quality |
| Head movement blur | Fast head movements often reveal glitching or warping at facial edges |
| Mouth sync errors | Even slight mismatches between mouth movement and audio are perceptible on close review |
| Reflection inconsistencies | Reflections in glasses or eyes may not match the scene correctly |
| Background anomalies | Backgrounds can morph or blur during movement in some systems |
| Jewellery and accessories | Earrings, glasses, and clothing details often behave oddly at frame edges |
Important caveat: These tells are decreasing with every generation of technology. The best current deepfakes are not reliably detectable by the human eye, particularly in compressed social media video. Do not rely entirely on visual inspection for high-stakes authentication.
Audio Deepfakes: The Real Situation
Voice cloning has become alarmingly accessible and convincing. Here is the honest picture:
What is currently achievable:
- A plausible voice clone can be created from as little as 30–60 seconds of audio using consumer tools (ElevenLabs, etc.)
- Higher quality clones benefit from more audio but are still possible with limited samples
- The clone can be made to say anything typed
- Cloned voices are often indistinguishable from real voices in short clips, especially in emotionally charged contexts (urgency, distress)
Documented fraud case (2024): A finance executive at a multinational company received a deepfake video call involving clones of his CEO and CFO, requesting a $25 million transfer. The call appeared to include multiple colleagues simultaneously. The executive was defrauded. Multiple similar cases involving voice cloning of family members ("virtual kidnapping" scams) have been documented in 2023–2024.
What this means practically:
- A phone call or voice message from a known person is no longer sufficient verification for a significant financial request or sensitive action
- Establish a "safe word" or verification question with close contacts for unexpected high-stakes requests
- Any unexpected request — even from a familiar voice — that involves urgency and money should be verified via a separately initiated call
Detection Tools
Several tools attempt to detect AI-generated content:
| Tool | What it detects | Accuracy | Notes |
|---|---|---|---|
| Hive Moderation | AI-generated images, video, text | High for AI images | API and web interface; used commercially |
| Sensity AI | Deepfake video and face swaps | High | Enterprise-focused; not free |
| Reality Defender | Multi-modal deepfake detection | High | Enterprise; used by news organisations |
| AI or Not | AI-generated images | Moderate | Free consumer tool |
| Deepware Scanner | Deepfake video | Moderate | Free, consumer-facing |
| GPTZero | AI-generated text | Moderate; many false positives | Used in education; accuracy debated |
Honest assessment of detection tools: No tool is 100% accurate. Most detection tools generate both false positives (flagging real content as fake) and false negatives (missing actual deepfakes). They are useful as one signal, not as definitive proof. The detection space is also rapidly evolving — tools that work well today may underperform as generation technology advances.
The SIFT Media Literacy Framework Applied to AI Content
SIFT was developed as a media literacy framework by researcher Mike Caulfield. Originally designed for news and web content, it applies directly to AI-generated and synthetic media.
S — Stop
Before reacting to or sharing content, stop. The moment of first reaction — surprise, outrage, delight — is the moment you are most vulnerable to being misled. Pause before doing anything.
I — Investigate the Source
Who is sharing this content? What do you know about them? Is this from a source you have prior reason to trust? An unfamiliar account, an urgent share from an emotional message, or content from a new platform warrant more scrutiny.
F — Find Better Coverage
Can you find the same claim or event from multiple independent sources? If a shocking video of a public figure exists, would it appear in credible mainstream news outlets? If the content is true and significant, independent sources will typically cover it.
T — Trace Claims, Quotes, and Media to the Original Context
The most important step for deepfakes. Reverse image search (Google Images, TinEye) can find the original context of an image. Video can often be traced to an original upload. Quotes can be traced to their source. A video clip shown out of context — even of a real event — can be profoundly misleading without being technically fake.
Applying SIFT: A Worked Example
You receive a WhatsApp message containing a video clip of a politician appearing to say something shocking. The video is shared by someone you know but the source is unclear.
S — Stop: Resist the impulse to share immediately. The emotional reaction (outrage) is exactly what makes this kind of content spread.
I — Investigate the source: Where did this video come from? What account originally posted it? Is there any visible context (news watermark, date, event)?
F — Find better coverage: Search the politician's name and the supposed content on news sites. If this were real, it would be front-page news. If no credible outlet has covered it, that is significant.
T — Trace the media: Download the video still and reverse image search it. Check when the clip was first uploaded online. Watch the video carefully for the visual tells described above.
Key takeaway: SIFT is not about becoming a forensic investigator for every piece of media. It is about slowing down the moment between seeing and sharing, and building a habit of sourcing before amplifying.
What Platforms and Governments Are Doing
- Meta, YouTube, and TikTok have policies requiring disclosure of synthetic media, particularly in political advertising
- The UK Online Safety Act includes provisions targeting non-consensual intimate deepfakes
- US legislation (No FAKES Act and others) is working through Congress to create legal liability for unauthorised deepfakes of real people
- C2PA (Coalition for Content Provenance and Authenticity) is developing technical standards for embedding verifiable provenance information into images and videos at creation
These measures are imperfect and enforcement is challenging, but they represent a meaningful response to a documented harm.
Practice Task
Do a deliberate media literacy exercise this week. When you encounter a striking image, video, or quote online — even from sources you generally trust — apply SIFT. Trace the original source of one piece of content. This builds a habit that will serve you well as synthetic media becomes increasingly prevalent.