AAWEA.ORG
AAWEA.ORG
AAWEA.ORG

Leaked AI Benchmark Report Photo

Leaked AI Benchmark Report Photo
πŸ“ Prompt Template
{
  "type": "photograph of a computer monitor displaying an academic technical report",
  "style": "slightly angled screen photo, visible moire pattern, LCD pixel grid, slight glare, LaTeX document formatting, serif fonts",
  "document_header": {
    "left": "4 Benchmark Evaluation",
    "right": "{argument name=\"report title\" default=\"DeepSeek-V4 Technical Report\"}"
  },
  "introductory_text": "Paragraph summarizing comprehensive evaluation of {argument name=\"main model name\" default=\"DeepSeek-V4\"} against {argument name=\"competitor model 1\" default=\"GPT-5.3\"}, {argument name=\"competitor model 2\" default=\"Claude Opus 4.6\"}, and {argument name=\"competitor model 3\" default=\"Gemini 3.1 Pro Preview\"}.",
  "visualizations": {
    "legend": "5 items with color codes: dark blue, grey, light grey, blue striped, light blue",
    "bar_charts": {
      "count": 6,
      "labels": [
        "MMLU-Pro (EM)",
        "GPQA-Diamond (Pass@1)",
        "AIME 2025 (Pass@1)",
        "LiveCodeBench (Pass@1-COT)",
        "SWE-bench Verified (Resolved)",
        "Tau-bench (Average)"
      ]
    },
    "caption": "Figure 1 | Performance comparison on core benchmarks. DeepSeek-V4 achieves state-of-the-art results across the majority of benchmarks."
  },
  "data_table": {
    "columns": [
      "Benchmark",
      "{argument name=\"main model name\" default=\"DeepSeek-V4\"}",
      "{argument name=\"competitor model 1\" default=\"GPT-5.3\"}",
      "{argument name=\"competitor model 2\" default=\"Claude Opus 4.6\"}",
      "{argument name=\"competitor model 3\" default=\"Gemini 3.1 Pro Preview\"}",
      "GPT-4.1"
    ],
    "categories": {
      "count": 4,
      "rows": [
        {"label": "General", "icon": "globe/network", "sub_items": 3},
        {"label": "Reasoning & Math", "icon": "calculator/clipboard", "sub_items": 3},
        {"label": "Code", "icon": "code brackets", "sub_items": 3},
        {"label": "Agent", "icon": "robot face", "sub_items": 3}
      ]
    }
  }
}
πŸ’‘ About This Prompt

Generates a realistic photograph of a computer screen displaying an academic technical report with bar charts and a detailed performance table.

A
Anneshu Nag
@anneshunag
Metadata
Published Jun 12, 2026
Model
GPT Image 2 10 cr/run
Category
Statistics
0
Likes
0
Views
0
Shares
0
Comments
0
Bookmarks
0
Uses
⚑ TRY IT NOW
Share Now

0 Comments

Sign in to join the discussion
πŸ’¬
No comments yet. Be the first!

πŸ‘₯ Co-learning Circle 0

Observe other members' variables & configurations, and click "Study & Retry" to instantly import settings and practice!

πŸ‘₯
No users have run this prompt yet.
Preview