Skip to content

Successful IMO 2025 Runs + Suggestions and a Challenge Problem #5

@xx205

Description

@xx205

Excellent work! Last week, I successfully used the code in this repo to reproduce the solutions for IMO 2025 p1 and p2 (using the version with extra prompts). However, I have a few suggestions:

  1. It would be helpful to allow setting a minimum number of successful agents, so the process can stop once that threshold is reached. When I ran p1, all 10 agents kept running until each one found its own solution. This seems unnecessary, considering that each correct solution is verified 5 times. For my own runs, I set the minimum number of successful agents to 2.
  2. The code could be modified to run with an OpenAI-compatible API, which Gemini natively supports.
  3. Requests could be made in streaming mode to avoid timeout errors, which I frequently encountered during my runs.

(I have already implemented the above changes in my version.)

Furthermore, I've found that while Gemini 2.5 Pro can successfully solve 5 of the IMO 2025 math problems using the code in this repo, it seems unable to fully solve the final question from the 2024 Chinese National College Entrance Examination (Gaokao) math paper (New Curriculum Standard, Type I). The problem is as follows:

*** Problem Statement ***

Let $m$ be a positive integer. Let the sequence $a_1, a_2, \ldots, a_{4m+2}$ be an arithmetic progression with a non-zero common difference. If after removing two terms $a_i$ and $a_j$ (where $i < j$), the remaining $4m$ terms can be partitioned into $m$ groups, with each group consisting of 4 numbers that form an arithmetic progression, then the sequence $a_1, a_2, \ldots, a_{4m+2}$ is called an \textbf{(i, j)-divisible sequence}.

Two distinct numbers $i$ and $j$ are chosen randomly from the set $\{1, 2, \ldots, 4m+2\}$, with $i < j$. Let $P_m$ be the probability that the sequence $a_1, a_2, \ldots, a_{4m+2}$ is an $(i, j)$-divisible sequence. Prove that $P_m > \frac{1}{8}$.

As a side note, without tool assistance (specifically, without using a web search to find a blog with the solution), I have not yet found any publicly available AI that can solve this problem completely, including o3, o3-pro, Gemini 2.5 Pro, and Grok 4.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions