OpenAI announces SearchGPT: its new AI search engine

schizoidman@lemmy.ml · 4 months ago

OpenAI announces SearchGPT: its new AI search engine

helenslunch@feddit.nl · edit-2 20 days ago

deleted by creator

featured [he/him]@lemmygrad.ml · 4 months ago

It’s not doing live queries at all, it just makes a statistically likely answer up from its training data

helenslunch@feddit.nl · edit-2 20 days ago

deleted by creator

featured [he/him]@lemmygrad.ml · 4 months ago

I mean yeah it does include data scraped from the web but that is all three years old at this point. Hardly a search engine by any metric

helenslunch@feddit.nl · edit-2 20 days ago

deleted by creator

featured [he/him]@lemmygrad.ml · 4 months ago

It literally doesn’t do that

helenslunch@feddit.nl · edit-2 20 days ago

deleted by creator

Xavienth@lemmygrad.ml · 4 months ago

This is like saying the library search engine and Bob the drunkard who looked at the shelf labels and swears up and down he knows where everything is are the same thing.

Look, ChatGPT is an averaging machine. Yes it has ingested a significant chunk of the text on the internet, but it does not reproduce text exactly as it found it, it produces an average of all the text it has seen, weighted towards what seems like it make sense for the situation. For really common information this is fine. For niche information, it is bullshitting without any indication.

helenslunch@feddit.nl · edit-2 20 days ago

deleted by creator

Xavienth@lemmygrad.ml · 4 months ago

ChatGPT is not a search engine, it generates predictions on what is the most likely text completion to your prompt. It does not pull information from a database. It is a mathematical model. Its weights do not contain the training data. It is not indexing anything. You will not find any page from the internet in the model. It is all averaged out and any niche detail is lost, overpowered by more prevalent but less relevant training data. This is why it bullshits. When it bullshits it is not because it searched for something and came up empty, it is because in the training data there simply was not a sufficient number of occurrences of the answer to influence its response against the weight of all the other more prevalent training data. ChatGPT does not search anything.

helenslunch@feddit.nl · edit-2 20 days ago

deleted by creator

Xavienth@lemmygrad.ml · 4 months ago

The information it generates comes from the model. The information from the model comes from the internet. The information it generates does not come from the internet. A to B to C, not A to C. I don’t know how to explain this more simply without crayons, the information from the internet does not exist within the model, but the average of the information can be recreated by the model. That is not what a fucking search engine does. A search engine doesn’t tell you the average results for your query, it gives you the most relevant results. At least, they should and used to. I can understand the confusion if you’ve only used a search engine in the past 3 years.

helenslunch@feddit.nl · edit-2 20 days ago

deleted by creator

gerryflap@feddit.nl · 4 months ago

From the train dataset that was frozen many years ago. It’s like you know something instead of looking it up. It doesn’t provide sources, it just makes shit up based on what was in the (old) dataset. That’s totally different than looking up the information based on what you know and then using the new information to create an informed answer backed up by sources