Skip to content

Commit c39c1cf

Browse files
Add description of CePO method
1 parent cbebb2c commit c39c1cf

File tree

1 file changed

+25
-0
lines changed

1 file changed

+25
-0
lines changed

README.md

Lines changed: 25 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,30 @@
11
# Cerebras Planning and Optimization (CePO)
22

3+
CePO is an inference-time computation method designed to enhance the accuracy of large language models (LLMs) on tasks requiring reasoning and planning, such as solving math or coding problems. It integrates several advanced techniques, including Best of N, Chain of Thought (CoT), Self-Reflection, Self-Improvement, and Prompt Engineering.
4+
5+
## Methodology
6+
7+
In CePO, the Best of N technique is applied to `bestoofn_n` solution candidates. Each solution is generated through the following four steps:
8+
9+
**Step 1**: Plan Generation
10+
The model generates a detailed, step-by-step plan to solve the problem, along with its confidence level for each step.
11+
12+
**Step 2**: Initial Solution
13+
Using the plan from Step 1, the model produces an initial solution.
14+
15+
Steps 1 and 2 are repeated `planning_n` times to generate multiple solution proposals.
16+
If the model exceeds the token budget during Step 1 or 2, the plan/solution is marked as incomplete, rejected, and regenerated. A maximum of `planning_m` attempts is made to generate `planning_n` valid proposals.
17+
18+
**Step 3**: Plan Refinement
19+
The model reviews all generated solution proposals and their associated plans, identifying inconsistencies. Based on this analysis, a refined, final step-by-step plan is constructed.
20+
21+
**Step 4**: Final Solution
22+
The model uses the refined plan from Step 3 to produce the final answer.
23+
24+
## Current Status
25+
26+
This project is a work in progress, and the provided code is in an early experimental stage. While promising, the method may require further development and testing to optimize its performance across different tasks.
27+
328
## Results
429

530
### Comparison of CePO with default settings and base model

0 commit comments

Comments
 (0)