Making AI trustworthy: Can we overcome black-box hallucinations?

2:30 AM PDT • August 24, 2023

Square Black Box Mockup on dark background. 3d rendering — **Image Credits:** Customdesigner (opens in a new window) / Getty Images

Mike Capps

Contributor

Dr. Mike Capps is CEO and co-founder of ethical AI startup Diveplane and former president of Epic Games.

Like most engineers, as a kid I could answer elementary school math problems by just filling in the answers.

But when I didn’t “show my work,” my teachers would dock points; the right answer wasn’t worth much without an explanation. Yet, those lofty standards for explainability in long division somehow don’t seem to apply to AI systems, even those making crucial, life-impacting decisions.

The major AI players that fill today’s headlines and feed stock market frenzies — OpenAI, Google, Microsoft — operate their platforms on black-box models. A query goes in one side and an answer spits out the other side, but we have no idea what data or reasoning the AI used to provide that answer.

Most of these black-box AI platforms are built on a decades-old technology framework called a “neural network.” These AI models are abstract representations of the vast amounts of data on which they are trained; they are not directly connected to training data. Thus, black-box AIs infer and extrapolate based on what they believe to be the most likely answer, not actual data.

Sometimes this complex predictive process spirals out of control and the AI “hallucinates.” By nature, black-box AI is inherently untrustworthy because it cannot be held accountable for its actions. If you can’t see why or how the AI makes a prediction, you have no way of knowing if it used false, compromised, or biased information or algorithms to come to that conclusion.

While neural networks are incredibly powerful and here to stay, there is another under-the-radar AI framework gaining prominence: instance-based learning (IBL). And it’s everything neural networks are not. IBL is AI that users can trust, audit, and explain. IBL traces every single decision back to the training data used to reach that conclusion.

IBL can explain every decision because the AI does not generate an abstract model of the data, but instead makes decisions from the data itself. And users can audit AI built on IBL, interrogating it to find out why and how it made decisions, and then intervening to correct mistakes or bias.

This all works because IBL stores training data (“instances”) in memory and, aligned with the principles of “nearest neighbors,” makes predictions about new instances given their physical relationship to existing instances. IBL is data-centric, so individual data points can be directly compared against each other to gain insight into the dataset and the predictions. In other words, IBL “shows its work.”

The potential for such understandable AI is clear. Companies, governments, and any other regulated entities that want to deploy AI in a trustworthy, explainable, and auditable way could use IBL AI to meet regulatory and compliance standards. IBL AI will also be particularly useful for any applications where bias allegations are rampant — hiring, college admissions, legal cases, and so on.

Companies are using IBL in the wild today. My company has built a commercial IBL framework used by customers such as large financial institutions to detect anomalies across customer data and generate auditable synthetic data that complies with the EU’s General Data Protection Regulation (GDPR).

Of course, IBL is not without challenges. The main limiting factor for IBL is scalability, which was also a challenge that neural networks faced for 30 years until modern computing technology made them feasible. With IBL, each piece of data must be queried, cataloged, and stored in memory, which becomes harder as the dataset grows.

However, researchers are creating fast-query systems based on advances in information theory to significantly speed up this process. This state-of-the-art technology has enabled IBL to directly compete with the computational feasibility of neural networks.

Despite these challenges, the potential for IBL is clear. As more and more companies seek safe, explainable, and auditable AI, black-box neural networks will no longer cut it. So, if you run a company — whether a small startup or a larger enterprise — here are some practical tips to start deploying IBL today:

Adopt an agile and open mindset

With IBL, it works best to explore your data for the insights it can give you, rather than assigning it a particular task, such as “predict the optimal price” of an item. Keep an open mind and let IBL guide your learnings. IBL may tell you that it can’t predict an optimal price very well from a given dataset but can predict the times of day people make the most purchases, or how they contact your company, and what items they are most likely to buy.

IBL is an agile AI framework that requires collaborative communication between decision-makers and data science teams — not the usual “toss a question over the transom, wait for your answer” that we see in many organizations deploying AI today.

Think “less is more” for AI models

In traditional black-box AI, a single model is trained and optimized for a single task, such as classification. In a large enterprise, this might mean there are thousands of AI models to manage, which is both expensive and unwieldy. In contrast, IBL enables versatile, multitask analysis. For example, a single IBL model can be used for supervised learning, anomalies detection, and synthetic data generation, while still providing full explainability.

This means IBL users can build and maintain fewer models, enabling a leaner, more adaptable AI toolbox. So if you’re adopting IBL, you need programmers and data scientists, but you don’t need to invest in tons of PhDs with AI experience.

Mix up your AI tool set

Neural networks are great for any applications that don’t need to be explained or audited. But when AI is helping companies make big decisions, such as whether to spend millions of dollars on a new product or complete a strategic acquisition, it must be explainable. And even when AI is used to make smaller decisions, such as whether to hire a candidate or give someone a promotion, explainability is key. No one wants to hear they missed out on a promotion based on an inexplicable, black-box decision.

And companies will soon face litigation in these types of instances. Choose your AI frameworks based on the application; go with neural nets if you just want fast data ingestion and quick decision-making, and use IBL when you need trustworthy, explainable, and auditable decisions.

Instance-based learning is not a new technology. Over the last two decades, computer scientists have developed IBL in parallel with neural networks, but IBL has received less public attention. Now IBL is gaining new notice amid today’s AI arms race. IBL has proven it can scale while maintaining explainability — a welcome alternative to hallucinating neural nets that spew out false and unverifiable information.

With so many companies blindly adopting neural network–based AI, the next year will undoubtedly see many data leaks and lawsuits over bias and misinformation claims.

Once the mistakes made by black-box AI begin hitting companies’ reputations — and bottom lines! — I expect that slow-and-steady IBL will have its moment in the sun. We all learned the importance of “showing our work” in elementary school, and we can certainly demand that same rigor from AI that decides the paths of our lives.

More TechCrunch

A US Trustee wants troubled fintech Synapse to be liquidated via Chapter 7 bankruptcy, cites ‘gross mismanagement’

Mary Ann Azevedo

11 hours ago

The prospects for troubled banking-as-a-service startup Synapse have gone from bad to worse this week after a United States Trustee filed an emergency motion on Wednesday. The trustee is asking…

A US Trustee wants troubled fintech Synapse to be liquidated via Chapter 7 bankruptcy, cites ‘gross mismanagement’

Space

Seraphim’s latest space accelerator welcomes nine companies

Aria Alamalhodaei

11 hours ago

U.K.-based Seraphim Space is spinning up its 13th accelerator program, with nine participating companies working on a range of tech from propulsion to in-space manufacturing and space situational awareness. The…

Seraphim’s latest space accelerator welcomes nine companies

OpenAI inks deal to train AI on Reddit data

Kyle Wiggers

12 hours ago

OpenAI has reached a deal with Reddit to use the social news site’s data for training AI models. In a blog post on OpenAI’s press relations site, the company said…

OpenAI inks deal to train AI on Reddit data

Social

X pushes more users to Communities

Sarah Perez

14 hours ago

X users will now be able to discover posts from new Communities that are trending directly from an Explore tab within the section.

Social

Mark Zuckerberg’s makeover: Midlife crisis or carefully crafted rebrand?

Amanda Silberling

15 hours ago

For Mark Zuckerberg’s 40th birthday, his wife got him a photoshoot. Zuckerberg gives the camera a sly smile as he sits amid a carefully crafted re-creation of his childhood bedroom.…

Mark Zuckerberg’s makeover: Midlife crisis or carefully crafted rebrand?

Strava taps AI to weed out leaderboard cheats, unveils ‘family’ plan, dark mode and more

Paul Sawers

16 hours ago

Strava announced a slew of features, including AI to weed out leaderboard cheats, a new ‘family’ subscription plan, dark mode and more.

Strava taps AI to weed out leaderboard cheats, unveils ‘family’ plan, dark mode and more

Robotics

Astronauts fall over. Robotic limbs can help them back up.

Brian Heater

16 hours ago

We all fall down sometimes. Astronauts are no exception. You need to be in peak physical condition for space travel, but bulky space suits and lower gravity levels can be…

Astronauts fall over. Robotic limbs can help them back up.

Enterprise

Microsoft’s custom Cobalt chips will come to Azure next week

Frederic Lardinois

16 hours ago

Microsoft will launch its custom Cobalt 100 chips to customers as a public preview at its Build conference next week, TechCrunch has learned. In an analyst briefing ahead of Build,…

Microsoft’s custom Cobalt chips will come to Azure next week

Transportation

Tesla keeps cutting jobs and the feds probe Waymo

Kirsten Korosec

17 hours ago

What a wild week for transportation news! It was a smorgasbord of news that seemed to touch every sector and theme in transportation.

Tesla keeps cutting jobs and the feds probe Waymo

Sony Music warns tech companies over ‘unauthorized’ use of its content to train AI

Aisha Malik

17 hours ago

Sony Music Group has sent letters to more than 700 tech companies and music streaming services to warn them not to use its music to train AI without explicit permission.…

Sony Music warns tech companies over ‘unauthorized’ use of its content to train AI

GrubMarket buys Butter to give its food distribution tech an AI boost

Rita Liao

17 hours ago

Winston Chi, Butter’s founder and CEO, told TechCrunch that “most parties, including our investors and us, are making money” from the exit.

GrubMarket buys Butter to give its food distribution tech an AI boost

Fintech

Bolt founder Ryan Breslow wants to settle an investor lawsuit by returning $37 million worth of shares

Christine Hall

17 hours ago

The investor lawsuit is related to Bolt securing a $30 million personal loan to Ryan Breslow, which was later defaulted on.

Bolt founder Ryan Breslow wants to settle an investor lawsuit by returning $37 million worth of shares

Enterprise

With the end of Workplace, it’s fair to wonder if Meta was ever serious about the enterprise

Ron Miller

17 hours ago

Meta, the parent company of Facebook, launched an enterprise version of the prominent social network in 2015. It always seemed like a stretch for a company built on a consumer…

With the end of Workplace, it’s fair to wonder if Meta was ever serious about the enterprise

Social

Meta Threads is testing pinned columns on the web, similar to the old TweetDeck

Ivan Mehta

18 hours ago

X, formerly Twitter, turned TweetDeck into X Pro and pushed it behind a paywall. But there is a new column-based social media tool in town, and it’s from Instagram Threads.…

Meta Threads is testing pinned columns on the web, similar to the old TweetDeck

Apps

Google expands hands-free and eyes-free interfaces on Android

Devin Coldewey

18 hours ago

As part of 2024’s Accessibility Awareness Day, Google is showing off some updates to Android that should be useful to folks with mobility or vision impairments. Project Gameface allows gamers…

Google expands hands-free and eyes-free interfaces on Android

Security

Hacker claims theft of India’s Samco account data

Jagmeet Singh

18 hours ago

A hacker listed the data allegedly breached from Samco on a known cybercrime forum.

Hacker claims theft of India’s Samco account data

Security

Ireland privacy watchdog confirms Dell data breach investigation

Lorenzo Franceschi-Bicchierai

18 hours ago

A top European privacy watchdog is investigating following the recent breaches of Dell customers’ personal information, TechCrunch has learned. Ireland’s Data Protection Commission (DPC) deputy commissioner Graham Doyle confirmed to…

Ireland privacy watchdog confirms Dell data breach investigation

Ampere teams up with Qualcomm to launch an Arm-based AI server

Frederic Lardinois

18 hours ago

Ampere and Qualcomm aren’t the most obvious of partners. Both, after all, offer Arm-based chips for running data center servers (though Qualcomm’s largest market remains mobile). But as the two…

Ampere teams up with Qualcomm to launch an Arm-based AI server

Google I/O was an AI evolution, not a revolution

Sarah Perez

18 hours ago

At Google’s I/O developer conference, the company made its case to developers — and to some extent, consumers — why its bets on AI are ahead of rivals. At the…

Google I/O was an AI evolution, not a revolution

Meet the Magnificent Six: A tour of the stages at Disrupt 2024

TechCrunch Events

18 hours ago

TechCrunch Disrupt has always been the ultimate convergence point for all things startup and tech. In the bustling world of innovation, it serves as the “big top” tent, where entrepreneurs,…

Meet the Magnificent Six: A tour of the stages at Disrupt 2024

Startups

Khosla Ventures, Pear VC triple down on Honey Homes, a smart way to hire a handyman

Mary Ann Azevedo

19 hours ago

There’s apparently a lot of demand for an on-demand handyperson. Khosla Ventures and Pear VC have just tripled down on their investment in Honey Homes, which offers up a dedicated…

Khosla Ventures, Pear VC triple down on Honey Homes, a smart way to hire a handyman

Apps

TikTok tests 60-minute video uploads as it continues to take on YouTube

Aisha Malik

19 hours ago

TikTok is testing the ability for users to upload 60-minute videos, the company confirmed to TechCrunch on Thursday. The feature is available to a limited group of users in select…

TikTok tests 60-minute video uploads as it continues to take on YouTube

Privacy

Flock Safety’s solar-powered cameras could make surveillance more widespread

Haje Jan Kamps

19 hours ago

Flock Safety is a multibillion-dollar startup that’s got eyes everywhere. As of Wednesday, with the company’s new Solar Condor cameras, those eyes are solar-powered and use wireless 5G networks to…

Flock Safety’s solar-powered cameras could make surveillance more widespread

Startups

Agora raises $34M Series B to keep building the Carta for real estate

Marina Temkin

20 hours ago

Since he was very young, Bar Mor knew that he would inevitably do something with real estate. His family was involved in all types of real estate projects, from ground-up…

Agora raises $34M Series B to keep building the Carta for real estate

Commerce

Poshmark’s ‘Promoted Closet’ tool lets sellers boost all their listings at once

Lauren Forristal

21 hours ago

Poshmark, the social commerce site that lets people buy and sell new and used items to each other, launched a paid marketing tool on Thursday, giving sellers the ability to…

Poshmark’s ‘Promoted Closet’ tool lets sellers boost all their listings at once

Google adds Gemini to its Education suite

Ivan Mehta

21 hours ago

Google is launching a Gemini add-on for educational institutes through Google Workspace.

Google adds Gemini to its Education suite

YC-backed Recall.ai gets $10M Series A to help companies use virtual meeting data

Kate Park

21 hours ago

More money for the generative AI boom: Y Combinator-backed developer infrastructure startup Recall.ai announced Thursday it has raised a $10 million Series A funding round, bringing its total raised to over…

YC-backed Recall.ai gets $10M Series A to help companies use virtual meeting data

Enterprise

CoLab’s collaborative tools for engineers line up $21M in new funding

Kyle Wiggers

21 hours ago

Engineers Adam Keating and Jeremy Andrews were tired of using spreadsheets and screenshots to collab with teammates — so they launched a startup, CoLab, to build a better way. The…

CoLab’s collaborative tools for engineers line up $21M in new funding

Apps

Reddit reintroduces its awards system

Ivan Mehta

21 hours ago

Reddit announced on Wednesday that it is reintroducing its awards system after shutting down the program last year. The company said that most of the mechanisms related to awards will…

Enterprise

Sigma is building a suite of collaborative data analytics tools

Kyle Wiggers

21 hours ago

Sigma Computing, a startup building a range of data analytics and business intelligence tools, has raised $200 million in a fresh VC round.

Making AI trustworthy: Can we overcome black-box hallucinations?

Mike Capps

Adopt an agile and open mindset

Think “less is more” for AI models

Mix up your AI tool set

More TechCrunch

Get the industry’s biggest tech news

TechCrunch Daily News

Startups Weekly

TechCrunch Fintech

TechCrunch Mobility

Tags