AI Impact Research

Comprehensive research on AI platform usage from frontier labs (2025)

View the Project on GitHub vishalsachdev/ai-impact

← AI Impact Research · AI Capabilities Research


Safety and Alignment

Research Question

How robust and safe are current AI models?

Hypothesis

AI safety characteristics vary significantly across models, with important implications for responsible deployment in educational contexts and AI ethics curricula.


Key Findings

1. Prompt Injection Resistance

Claude Opus 4.5: Industry-leading resistance to prompt injection attacks (Anthropic)

2. Jailbreak Resistance

Models vary in susceptibility to:

3. Hallucination Rates

Model Hallucination Rate Trend
Frontier models (2025) 5-15% Improving
Previous generation 15-25% -
Early LLMs 25-40% -

4. Safety-Capability Tradeoffs


Implications for Universities

AI Ethics Curriculum

Deployment Decisions

Research Ethics



Explore This Research


← Previous: Agentic Capabilities Next: Educational Implications →