My experience with Gemini models is that in agent mode, they frequently will fai...

My experience with Gemini models is that in agent mode, they frequently will fail to apply the changes that they say it has made.

Then you gave to tell it that you forgot to apply the changes and then it's going to apologize and apply.

Other thing I notice is that it is shallow compared to Claud Sonnet.

For example - I gave identical prompt to claud sonnet and Gemini.

Prompt was that explore the code base and take as much time as you need but end goal is to write an LLM.md file that explains the codebase to an LLM agent to get it up to speed.

Gemini did single shot it generating a file that was mostly cliche ridden and generic.

Claud asked 8 to 10 questions in response each of which was surprising. And the generated documentation was amazing.