card file · JAI-20 ai conversations / ai operations card 20 of 73
Reliability card
category: ai conversations section: ai operations page: /card/reliability

Reliability

how you make sure ai delivers reliable, accurate output

AI output can look confident and still be wrong. This card is about how the newsroom checks that what AI produces is accurate enough to use, and where errors could slip through. A team pauses here because in journalism a wrong fact carries a real cost.

Questions to explore

// use these as prompts in a workshop or on your own. There are no right answers.
  1. How do you check that AI output is accurate before it reaches your audience?
  2. Where in your workflow is an AI mistake most likely to go unnoticed?
  3. What level of accuracy is good enough for each way you use AI?
  4. How do you tell the difference between output that sounds right and output that is right?
  5. Who is responsible for catching errors, and do they have time to do it?

Expert voices

// notes from the journalists and AI experts who helped shape this kit

“AI technology is not trustworthy by default, and LLMs can make things up. How do we check the accuracy of AI systems and verify their results?”

Steffen Leidel, DW Akademie

“AI systems confidently produce false information: fabricated quotes, invented sources, non-existent studies. Journalists must understand why this happens to avoid publishing AI-generated misinformation.”

Lynn Khellaf, DW

“The machine is guessing. It is a formulation machine, not an analysis or fact machine, and testing its reliability is difficult and needs to happen over time.”

Peter Deselaers, DW Akademie

“AI produces statistically convincing lies that mix real and fake information. It shows you what looks right statistically, not what is true, so treat every AI-generated fact or quote as unverified until checked against a primary source.”

Mwende Mukwanyaga, AI Salon

Things to consider

  • AI can state something false with the same confidence as something true.
  • The risk of an error depends on what the output is used for.
  • A check that is skipped under deadline pressure is not a reliable check.
using this card

Pull Reliability when it is relevant and set it aside when it is not. Pair it with the other AI Conversations cards, lay them out on a table, and use the questions above to get everyone on the same page. Capture what you discuss on sticky notes or in a shared doc.

More AI Conversations cards

~/library/ai-conversations