Does o3 doing increasingly well with problems designed to be difficult actually mean it's getting closer to being an innovative genius?