
I built a game where domain experts try to break frontier AI
I built a game where domain experts try to break frontier AI
- Freemium
- Web
- DesignAI Model Benchmarking & Evaluation
- Free plan available
- No credit card
What is I built a game where domain experts try to break frontier AI?
Key features
Submit expert questions
Create and submit domain-specific questions intended to expose AI model failures
Multi-model testing
Test your questions against several frontier AI models to find discrepancies
Failure verification
Platform reviews submissions to confirm genuine AI failures before payment
Payment for verified fails
Earn money when your questions successfully stump AI models and pass verification
Freemium access
Use basic features at no cost; paid tier likely offers additional benefits
Pros & cons
Advantages
- Practical contribution to AI safety and research by documenting real model limitations
- Straightforward way for experts to earn money by doing something within their domain knowledge
- Builds transparent documentation of where frontier AI actually fails, useful for practitioners and researchers
- No technical expertise required beyond domain knowledge in your field
Limitations
- Payment depends on verification, so not all submitted questions will generate income
- Limited to individuals with genuine expert knowledge; casual users unlikely to succeed
- Verification process may be slow or have unclear criteria for what constitutes a 'failure'
Use cases
Medical professionals testing AI diagnostic tools for accuracy in clinical edge cases
Lawyers reviewing AI legal research tools for incorrect interpretations of case law
Software engineers finding bugs in AI code generation models
Academics documenting domain-specific weaknesses in large language models
Industry specialists gathering evidence of AI limitations before deployment decisions
Ready to try I built a game where domain experts try to break frontier AI?
Pricing
Premium
Pricing not publicly specified
Likely includes additional testing capacity, faster verification, or enhanced analytics
Get started with I built a game where domain experts try to break frontier AI
Click through to I built a game where domain experts try to break frontier AI and start using it now.
- Free plan available
- No credit card