Goldman Sachs data chief says AI facing ‘training data shortage’: ‘I think the real interesting thing is going to be…’
The artificial intelligence (AI) industry is confronting a critical shortage of high-quality training data, a constraint that may already be shaping the next generation of AI systems, Neema Raphael, Goldman Sachs’ chief data officer and head of data engineering, has said.
“We’ve already run out of data,” Raphael stated, noting that this deficit is forcing companies to increasingly rely on synthetic data—machine-generated text, images, and code. Raphael made the assertion on the bank’s “Exchanges” podcast, confirming a growing industry suspicion that the readily available data on the open web has been exhausted.
