2 Comments
User's avatar
David Karnok's avatar

Will you try other LLM flavors, such as ChatGPT/Gemini/GabAI, in a similar fashion?

Vox Day's avatar

Not in this context. There isn't any point due to the limitations of the various models. Gemini is an overenthusiastic cheerleader, ChatGPT invents nonexistent flaws. Both are unreliable. Grok has serious limitations too, but they can be managed.