In the challenge, VERSES compared the DeepSeek-R1 model to Genius. Each model attempted to crack the Mastermind code on 100 games within up to ten guesses. Each model was given a hint for each guess ...
Results that may be inaccessible to you are currently showing.
Hide inaccessible results