A new open-source AI model named Devin has shown promising results in software engineering benchmarks. Devin achieved 12.29% accuracy on 100% of the SWE Bench test set, compared to 13.84% on 25% of the set. The model uses GPT4 and is expected to improve with GPT5. The project is led by a team including John and has garnered attention for its potential in generalization.
Open source with results close to Devin #ai #coding https://t.co/eNxX1dk8FO
Exciting open source Devin from Princeton https://t.co/2eiigNO0Pn
Open source Devin, with very impressive numbers. From the thread: 'letting SWE-agent only view 100 lines at a time was better than letting it view 200 or 300 lines and much better than letting it view the entire file'. https://t.co/mQsplpewWw