OpenAI unveils CriticGPT, a model that is built to critique GPT4
OpenAI recently announced a new model called CriticGPT based on GPT4. Unlike the other models from the company, which are consumer-facing, CriticGPT is designed to “write critiques of ChatGPT responses to help human trainers spot mistakes during reinforcement learning from human feedback (RLHF).”
CriticGPT is based on GPT-4 and will help the human trainers at OpenAI to “catch errors in ChatGPT’s code output.” According to OpenAI, code reviewed by CriticGPT can outperform unreviewed code by 60 per cent. The company is currently integrating CriticGPT-like models into the RLHF labelling pipeline to assist AI trainers in evaluating outputs from advanced AI systems.