Abstract: We performed a comparative analysis of code generation model performance with evaluation using common NLP metrics in comparison to a test-based evaluation. The investigation was performed in ...
The International Mathematical Olympiad (IMO) is a prestigious competition featuring talented high school students from around the world, in which competitors solve complicated mathematical problems.