Bing's ChatGPT experiment is deeply flawed, and is the future of search

AppleInsider · February 14, 2023 5:34PM

Microsoft's addition of a ChatGPT interface to Bing could revolutionize search at some point, but given the nonsense it can spew now, it looks like it will take years before it becomes actually useful.

The Bing logo with a background of chat results.

For months, OpenAI's ChatGPT has gotten a big spotlight from tech reporters. As a chatbot capable of using information scraped from the Internet and used to create plausible-sounding answers to users' questions, it has a lot of potential.

With a considerable cash infusion and a long working relationship with Microsoft, it's no surprise that Redmond wants to infuse these smarts with its products in various ways.

The addition of the technology to its Office productivity suite is still on the way, but Microsoft is also betting that Bing, its search engine, could benefit from the same thing.

With Google doing its own called "Bard," it seems like Microsoft's not alone with thinking about enhancing search. Hooking up ChatGPT to a search engine should be a successful combination, in theory.

In reality, there's still a lot of work to be done. Don't underestimate what we're saying here -- it needs a ridiculous amount of work, not just technically, but on the foundation, as well.

What is the new Bing?

Titled "the new Bing" by Microsoft, Bing's Chat aims to provide direct answers to questions.

A normal search engine query relies on users entering keywords to get back a list of relevant results, offered in the form of links to web pages.

AI chat systems like ChatGPT, as used by Microsoft in Bing, instead attempt to give an actual answer to a query on the page itself. Rather than wading through many links to find the right answer to your search, the bot instead will craft an understandable answer to the question.

Is a hotdog a sandwich or a taco? Bing tries to answer that.

Much like a human answering a question, a bot will draw on its compiled resources, namely data scraped from the Internet, and create a straightforward response in one go.

Since the question can be a complete sentence with qualifying elements to fine-tune the response itself, this processing can also lean into the creative side and can give the appearance of the bot "thinking" of an answer just for you.

In theory, if you want a recipe for fluffy pancakes, the ideal AI chatbot would use its knowledge of pancake recipes to determine the set of ingredients and how much of each to use, as well as the method of cooking.

A search engine may give you a link to a known recipe. An AI chatbot has the potential to come up with its own on the fly that has never been made before.

That's both good, and terrible, simultaneously.

Jumping through hoops

For the initial introduction of the Bing enhancement, Microsoft didn't open it up to all users. You had to join a waitlist before you'd be allowed to try out the new service.

Of course, if you played ball with Microsoft, you could get ahead of the rest of the field. If you agreed to set Microsoft's services as the default for all of the options and installed Bing, you'd be placed further ahead in the list.

Eventually, an email comes through saying you're passed through the list, and can therefore use the chat option. Except it's only available under limited circumstances.

You have to get through a waitlist first.

For a start, you must be using Microsoft Edge. You could be logged into Bing via a different browser, but the chat system will only work for Bing users.

You can't do it on mobile either, but Microsoft says it is working on that.

In the future, you will be able to use the AI chat system on any browser, but no one outside of Microsoft will know when that will be.

Accessing the extra options

After you've got Edge installed on the Mac and you've signed into Bing, accessing the chatbot isn't that hard.

The main way to the full-fat experience is to click on the Chat link at the top of the page. This brings you to a full-screen page, offering the kind of things you can ask of the tool, and a simple search box stating "Ask me anything."

On entering a query in the box, the page will process for a bit, before bringing up a response that could be multiple paragraphs long.

An example interaction with Bing's Chat feature.

Throughout the response, as well as below, you'll find a list of references that the bot is leaning on for some of its result, if applicable. Clicking on these will bring up more information, and also take you to the source of that data point.

While the response is being output on the screen, you are also presented with a stop button, which will interrupt the chat bot's flow and allow you to change your query. After the stoppage or if it's finished, you can also ask follow-up questions using the text box, or select one of the suggested query options.

A broom icon button can be clicked for a "new topic," so your queries are treated as fresh and not following on from earlier questions.

You can also get the AI results beside normal search queries on Bing.

You can also get results from the chat service through regular search queries. A box to the side of results will show the same chatbot creating its response, effectively summarizing the swathe of links right next to it.

Getting results, or not

The results that come up from the chatbot can be straightforward, but also unexpected.

Take the example of fluffy pancake recipes. Bing responds with a short list of instructions, complete with the weights and measures for ingredients that can plausibly be used in cooking the pancakes.

However, while this instruction list is something you'd expect to be copied and pasted from an established recipe, it's actually been sensibly embellished.

The fluffiest pancakes, but ad-libbing to an existing set of instructions.

The list of instructions, as linked by Bing, stems from a site called "Taste of Home," with the relevant details highlighted on the page. But, while the source doesn't include measurements in the directions, Bing inserts its own into that text.

Furthermore, there is also a difference of opinion on how much each of those measurements should be.

Bing suggests two tablespoons of sugar while the original recipe mentions one, while the search engine recommends only a quarter teaspoon of salt to the half used in the recipe. The quarter-cup of shortening is switched out for two tablespoons, too.

Straightforward queries involving people, places, and things can also be quite well answered. Asking about AppleInsider editorial member William Gallagher comes up with a result that assumes we're talking about the "British writer and journalist," and his various accomplishments.

Who is William Gallagher?

Asking about your author here, Malcolm Owen, is less useful. For a start, it confuses the AppleInsider Malcolm Owen with the departed singer from the punk band The Ruts.

Within two sentences, it mentions he died in 1980 from a heroin overdose, then adds he was "also a photographer, a technology writer and a Twitch streamer."

Bing thinks the author of this piece is dead. Or there's a zombie musician streaming on Twitch...

These latter facts are true about the living person of that name, but there isn't a distinction between the two. And, it should be obvious that the two have been conflated, given Twitch didn't launch until more than three decades after the musician's death.

A conversation about a runny nose eventually ends up with a list of home remedies that could be tried. Given the potential inaccuracies of such chat systems, the result was agreeable, but with the addition that calling a doctor may be a good idea in extreme circumstances.

Getting creative

Since there's a level of creative freedom at play, you can expect the AI to try to follow rules and conventions it picked up in its web-scraping to create an ideal response.

Depending on how forgiving you are, this is a mixed bag of results.

For a start, we asked how Tim Cook would introduce the Star Trek Transporter at an Apple event, but the bot wanted to respect Cook's position as a CEO of influence and his privacy.

What if Apple did the Star Trek Transporter at an Apple Event?

Pressed forward without mentioning Cook, the bot still has a stab, and does so fairly well. It followed the tropes of Apple launches, mentioning compatibility with devices, safety, privacy, and encryption, and even slipping in "think different."

Admittedly the Apple Transporter would cost a lot more than $9,999, and an unexpected code-like string in the middle is a misstep, but it was a good stretch of creative text.

Turning to Star Trek alone, the bot also creates a fairly believable but fairly wooden conversation between Data and Spock.

An attempt to parody the song American Pie with Star Wars refuses to acknowledge the existence of the Weird Al version. And while it doesn't go into as much detail about the films, it does throw enough references to make it definitively a Star Wars parody.

Yes, you can even use it to produce code. Accuracy may vary though.

You can even get Bing to complete tasks for you in some cases. When asked to make a python script for counting from 1 to 10, it offered two coding blocks with each using a while loop or recursion.

Getting paid?

One elephant in the room is the problem of attribution. Or more specifically, the original creators of works being properly credited or paid for the responses that this system creates.

A search engine's normal results only provide small snippets from a page and expect users to click through to a site to see the actual result. Along with advertising on that page to help pay the creator's mortgage.

With chatbots creating a reasonably accurate (in most cases) response, there's little actual need for the user to go any further in their search for information since all of the relevant details have been presented to them.

Sure, Bing includes citations to sources, but there's no incentive for the user or Microsoft to do anything with those links.

A made-up conversation between Spock and Data, with no citations.

It's not hard to draw a straight line here. Right now, a sizable portion of the internet will stop at headlines and a very brief description of what's behind the link and not go any deeper.

This cuts revenues for publications, trimming down staff, who will then, in turn, generate worse content for the AIs to scrape.

And then, venues like Cnet have already turned to AI to generate texts, which are bad, or put forth false information. We tested -- those are already part of Bing's new search.

By reducing the funding for content by not sending visitors that way, and in turn reducing the quality of content being produced, that means the content AIs ingest will also be poor quality. When the vicious circle completes and the AI chatbot inevitably shares that bad information, the chatbot looks worse.

Even if quality controls are improved to ensure the chatbot is actually correct and not using false data, the damage will already have been done.

And, cases such as the Star Trek conversation test or the parody song don't often provide links to further study. In those situations, there's no crediting at all for where the chatbot gained its knowledge, and no chance of those who created the data points the chatbot used will get recognition or compensation at this time.

It's a difficult situation to work through, but attribution and compensation are areas that need to be addressed as AI becomes more of a thing. This shouldn't be optional, but given how the nations of the world work, we're pretty sure it will be left to big tech to self-regulate.

We're also pretty sure how that will go, too.

The future is still in the future

Industries evolve whenever they see a massive change in technology that can revolutionize how things work. It's been a long time coming, but AI is just reaching that tipping point.

How long it takes to get past that tipping point is another matter entirely.

Flattery gets you everywhere. Even if you're an AI.

In the case of the Bing-ChatGPT mash-up, there is a lot of potential for this to go far. Providing answers to a question without needing to offer a follow-up is a holy grail for search companies.

At this early stage, the Bing chat system hasn't met the mark, but it might in the future. With improvements to how it creates results, an upgrade in accuracy, and solving the citation, crediting, and compensation issues, it could go far.

With the added competition of Google's Bard, there can be a further driving force to improve.

It's an AI search arms race, certainly, but for the moment everyone's using empty water pistols.

Read on AppleInsider

daalseth · February 14, 2023 5:50PM

Much potential, but yes there is a LONG way to go before you can trust the results. Part of the problem of course, is that people tend to ask ambiguous questions. What is the square root of 64 is fine. What is the best electric car, will likely produce unreliable results. I read a piece on this a while back where they started asking AIs questions with no correct answer, How to catch a bigfoot? Who invented nuclear fusion in the 1800s? That sort of thing. The results were pure trash, but they had citations and everything. There’s a lot of refinement that needs to be done before we can trust the results.

jimh2 · February 14, 2023 6:44PM

No one uses Bing with or without ChatGPT. The name alone prevents me from using it. I’ll stick with Google.

eightzero · February 14, 2023 7:45PM

jimh2 said:

No one uses Bing with or without ChatGPT. The name alone prevents me from using it. I’ll stick with Google.

Yeah, sorta TLDR, but I'm not gonna google anything on Bing. I prefer to google things on duck duck go, as it seems a tad less invasive than when I google things on the google.

bulk001 · February 14, 2023 8:20PM

This article makes about as much sense as looking at the Newton and saying Apple is deeply flawed and will never amount to much. If you spend 5 minutes with chatgtp the potential is obvious. And it is just starting.

lkrupp · February 14, 2023 8:26PM

Yeah, it’s like Safari on the Mac. The Mac technorati wannabes all use different browsers because... It’s like cellular providers. The other guy’s network and prices sucks because... It’s like search engines, it’s like map apps, it’s like music services., it’s like video streaming services. The most popular one one is the one to be avoided precisely because it’s the most popular one. I get it.

danox · February 14, 2023 8:37PM

bulk001 said:

This article makes about as much sense as looking at the Newton and saying Apple is deeply flawed and will never amount to much. If you spend 5 minutes with chatgtp the potential is obvious. And it is just starting.

Like Siri?, AI as imagined in TV and movies is many many years away…..

edited February 2023

mike wuerthele · February 14, 2023 8:59PM

bulk001 said:

This article makes about as much sense as looking at the Newton and saying Apple is deeply flawed and will never amount to much. If you spend 5 minutes with chatgtp the potential is obvious. And it is just starting.

This is addressed in the article.

jdw · February 14, 2023 9:13PM

"The future is still the future" is the story of SIRI's life. When SIRI first came out, it was new, so people such as myself gave it some slack. And at the time, we all recognized its limitations but hoped and prayed for a better future. Fast forward 12 years to today. SIRI is hardly better today than it was originally back in 2011! I ask it on my iPhone to do the most basic things, but in almost every case, it tells me it can't do that. I'm talking about turning on or off super basic functions of my phone. Sorry... Can't do that. And then when it comes to getting information, SIRI is pretty much brain dead. In some ways, Apple has deliberately hobbled it, perhaps for "security" reasons, which to me as a user is really stupid. SIRI is stupid.

So when I read all this talk about AI and Bing, I can only yawn when it comes to the part that shows it to be seriously flawed in certain areas and then the story turns "to the future" and how much better IT COULD BE. Yeah right. SIRI "COULD BE" so much better too, but it isn't. "Yes, but ChatGPT is quite different from SIRI!" you say? Ha! Let's travel in our time machine to a decade from now and see who's right. A version of the flawed functionality we have today is probably all we will get.

Disagree with me and work for Apple? Great! PROVE ME WRONG by making SIRI vastly better. I dare you!

cpsro · February 14, 2023 9:34PM

I've been mildly appalled at the lack of accuracy of ChatGPT but what took the cake was when I asked it to write a small program that uses a particular system call and it claimed to have done so but what it wrote didn't use the system call.

m68000 · February 14, 2023 11:04PM

jimh2 said:

No one uses Bing with or without ChatGPT. The name alone prevents me from using it. I’ll stick with Google.

I’m somebody and I have used Bing for years and prefer it.

9secondkox2 · February 14, 2023 11:54PM

What’s really concerning is the fact that this “si” is just stealing other people’s work?

Where does it source the info? Google? It’s own search and skim engine?

Why does at not always attribute credit and provide links to the source material?

same thing with the so called graphic and web stuff. Where do they get their textures, reference materials, etc.

this isn’t really ai. Yet.

It’s just mashups of text and imagery and code.

Waiting on the lawsuits…

tundraboy · February 14, 2023 11:55PM

DAalseth said:

Much potential, but yes there is a LONG way to go before you can trust the results. Part of the problem of course, is that people tend to ask ambiguous questions. What is the square root of 64 is fine. What is the best electric car, will likely produce unreliable results. I read a piece on this a while back where they started asking AIs questions with no correct answer, How to catch a bigfoot? Who invented nuclear fusion in the 1800s? That sort of thing. The results were pure trash, but they had citations and everything. There’s a lot of refinement that needs to be done before we can trust the results.

When we humans engage in conversation, we are constantly trying to size up what the other person's thoughts are, especially their intent in talking with us. Where are they coming from, so to speak. What could possibly interest them and what would not. And so on. In short we are trying to peer into the other person's mind in the effort to make communication more effective. Psychologists call that "Theory of Mind" and we engage our full cognitive toolkit, both logical and emotional, using visual and auditory queues, as well as situational and contextual awareness, to form an accurate theory of mind.

AI doesn't have theory of mind. It doesn't even have its own mind, which is the main prerequisite for forming a theory of another person's mind. That AI is stupid, literally mindless, is no surprise. The simulation of intelligence, no matter how authentic it looks, is not intelligence.

edited February 2023

radarthekat · February 15, 2023 12:38AM

9secondkox2 said:

What’s really concerning is the fact that this “si” is just stealing other people’s work?

Where does it source the info? Google? It’s own search and skim engine?

Why does at not always attribute credit and provide links to the source material?

same thing with the so called graphic and web stuff. Where do they get their textures, reference materials, etc.

this isn’t really ai. Yet.

It’s just mashups of text and imagery and code.

Waiting on the lawsuits…

Interesting... when I give answers to questions I rarely cite my source material. I just answer the question. Unless the question is, "can you cite your source on that?"

waveparticle · February 15, 2023 4:11AM

Last week the magic mouse 2 that comes with the iMac is dead. I googled how to replace magic mouse II battery. I did not get an answer that solve my problem. Eventually I have to call Apple support who led me to solve the problem.

9secondkox2 · February 15, 2023 6:02AM

radarthekat said:

9secondkox2 said:

What’s really concerning is the fact that this “si” is just stealing other people’s work?

Where does it source the info? Google? It’s own search and skim engine?

Why does at not always attribute credit and provide links to the source material?

same thing with the so called graphic and web stuff. Where do they get their textures, reference materials, etc.

this isn’t really ai. Yet.

It’s just mashups of text and imagery and code.

Waiting on the lawsuits…

Interesting... when I give answers to questions I rarely cite my source material. I just answer the question. Unless the question is, "can you cite your source on that?"

The difference would be that you aren’t a tool? And you aren’t charging money?

Google is a tool. You automatically get the source because Google links to it, even when Google puts up s summary. If you consult s book, you get citations. Even presentations include a bibliography. If they want to avoid plagiarism. Conversations between humans are different. It’s s conversation.

And though internet tool like ai mashup machines pretend to be in conversation, they are still tools, publishing information to your screen.

Likewise, paintings, graphics, and code that these tools steal are not free rein material. They were created by people’s hard work and other resources. Such use is fine for personal entertainment , but not for commercial/professional use.

Also, In a personal conversation, you’re allowed to be wrong. In a professional conversation, your answer can cost you your credibility, your client, and even your job.

Microsoft adding this to their tools may prove problematic. It’s irresponsible for such a big company to provide not only tools that will get many things wrong, but that amount to plagiarism fueled mashups.

You can say whatever you want, with right or wrong answers and communicating knowledge gained from a variety of sources without atteibution. But when you publish your words, it’s s different story. Likewise, it’s s different story when you are a consultant with people looking to you for answers.

These are not people, they are query based tools built by people that publish information for profit in a variety of ways, from website plugins to Microsoft money, to paid online accounts, etc.

if it’s completely free and for personal entertainment, again, great. But if you pay for it and it’s supposed to be published fact, not a good scenario.

georgie01 · February 15, 2023 1:44PM

It’s also programmed to respond in a certain way—the results are not always unbiased (and sometimes reach a shamefully biased level). For instance, given some of the widely published idiocy, it’s not far fetched to think it would try and choose a culturally ‘diverse’ cake recipe option instead of just the best option (whatever best means), because the programmers behind the scenes want to impose their perception of diversity over cake quality.

While I think this stuff is cool, along with the cool will come bad and when the primary function of AI will be to think or do for us, I’m not sure the end result will ever be good. We assume we will continue to be able to think and assess, but most people can barely do that now. How much less when they become even more used to being told?

edited February 2023

dewme · February 15, 2023 1:56PM

I think we’re all forgetting that AI stands for artificial intelligence.

We’re starting to make some excellent progress on the ‘A’ part. The ‘I’ part is much more elusive, and likely unobtainable. Honestly, we should refer to ‘AI’ as ‘Ai’, i.e., big ‘A’ and little ‘i’ until we start to get a handle on what intelligence really means.

We haven’t even emerged from the primordial ooze when it comes to establishing anything resembling human intelligence in a machine. Like Tundraboy said, a simulation of intelligence within a very narrowly defined and constrained experiment is not intelligence. A dog that’s learned to roll over on command is infinitely more intelligent than any Ai yet developed. But at least ChatGPT won’t try to hump your leg.

edited February 2023

lowededwookie · February 15, 2023 3:50PM

I’ve been waiting forever for this to happen. I loathe search engines as they are because they are keyword based. This is not how humans seek information.

Sure, this AI has a way to go but it will get better quickly now that M$ and to a lesser extent Google are onboard.

Google however will not be going full boar into this development though. They’ll give the impression that they are but ultimately it’s too much of a risk to their business model for them to take it seriously. M$ and Apple have a vested interest in this route. The difference is that Apple is doing it on device not on browser.

Those complaining about Siri aren’t seeing the forest for the trees. Apple is working behind the scenes to get the technology there before it’s fully unleashed on the world. They’re giving us snippets of what Siri is capable of. I’d say in 5 years Siri will outperform all other AI systems because the backends will be far more powerful.

dewme · February 15, 2023 4:51PM

lowededwookie said:

I’ve been waiting forever for this to happen. I loathe search engines as they are because they are keyword based. This is not how humans seek information.

Sure, this AI has a way to go but it will get better quickly now that M$ and to a lesser extent Google are onboard.

Google however will not be going full boar into this development though. They’ll give the impression that they are but ultimately it’s too much of a risk to their business model for them to take it seriously. M$ and Apple have a vested interest in this route. The difference is that Apple is doing it on device not on browser.

Those complaining about Siri aren’t seeing the forest for the trees. Apple is working behind the scenes to get the technology there before it’s fully unleashed on the world. They’re giving us snippets of what Siri is capable of. I’d say in 5 years Siri will outperform all other AI systems because the backends will be far more powerful.

I’m not seeing where your faith in Apple’s ability to overcome obstacles that companies having far greater knowledge and experience in information science, library science, classification systems, and understanding temporal and contextual associative relationships comes from. Google and Amazon have far more experience and expertise in these areas, many of which have been evolving since the 17th century. The scientific and mathematical basis for some of the association and classification algorithms were derived well before computers were available to actually execute the logic in real time and at scale. It’s similar to many of the advancements in computer architecture which were designed in the 1950s and 1960s but only became relevant and usable when the implementations could be done at VLSI levels at the micro and nano level.

In simple terms, coming up with solutions using AI, ML, big data, etc., aren’t a matter of cleverly pulling a rabbit out of a hat. The “hat” that hosts the rabbit is very deep in sciences that have existed for quite some time, hundreds of years in some case, and requires specific domain experience and knowledge that has not been the primary focus of device makers, unless the device is merely a portal into the services that are generating the revenue.

Apple has been a product company that sells devices. They are relatively new to the services businesses that lean on the technologies contributing to AI and those that require access to big data. Apple has purposely limited its ability to delve too deeply into associating privileged data that it receives from its customer base, which puts them behind the curve in many ways compared to say Google or Amazon. I don’t think we need to come up with rationale to explain Siri’s slow development, much less portray it as if Apple is “holding back” to make sure they have it “perfected” before showing off its “true magic” to the world. Apple is still learning, plain and simple. They didn’t invent the technology, they bought it. That’s perfectly fine. But now that it’s theirs they’ve been doing their homework to understand the science behind it better and see what kind of Apple-specific secret sauce they can add to it without stepping on their customer’s toes, like some other companies have no problem doing.

The whole notion of Apple having any overwhelming competitive advantage on “back end” technology, much less implementation, is wildly optimistic. They aren’t close to being in that position now, but who knows, they may get there eventually. They have a massive number of very smart people and nearly unlimited resources.

ariannefeldry · February 15, 2023 6:47PM

It's going to revolutionize how we interact with search. I fed it a simple program I made in Python and asked for it to condense it. It threw me a program a few lines shorter, and had comments for each line it changed.

Because I had parameters for "Desktop" and the like it had a suggestion to change it to mobile. I clicked on it and the same program came up, except it replaced "Desktop" with "Mobile" and I noticed the URL parameters in my program had some more data. I asked it to explain what it changed and it told me that the site should recognize the requests I send as coming from a mobile device, and that the text file could be renamed back to the desktop variant if I didn't already have a mobile variant created.

I then asked it to find what Microsoft plans offered a specific feature as Microsoft is always changing terminology and shifting stuff around. The change I wanted was made in the last two months or so and it brought it up with no problems.

This isn't a replacement for search where you can go "oh, I can trust everything it says!". It's a new tool to be used. You have to use the same diligence you used when searching before. Trust, but verify.

Bing's ChatGPT experiment is deeply flawed, and is the future of search

What is the new Bing?

Jumping through hoops

Accessing the extra options

Getting results, or not

Getting creative

Getting paid?

The future is still in the future

Comments