The Double Edged Sword
Audio Narration0:00/410.3053061224491×
A key measure of AI progress is the length of time a model can work on so called ‘long horizon tasks’ without going off the rails. Most high value real world jobs involve some ‘long horizon tasks’, i.e. tasks that take hours rather than