TheAgentCompany (CMU Research)
https://github.com/TheAgentCompany/TheAgentCompanyCarnegie Mellon University's benchmark simulation of a company staffed entirely by AI agents — revealing that the best models complete only 24% of real business tasks.
Carnegie Mellon University's benchmark simulation of a company staffed entirely by AI agents — revealing that the best models complete only 24% of real business tasks.