• 0 Posts
  • 17 Comments
Joined 2 years ago
cake
Cake day: June 11th, 2023

help-circle







  • Synthetic data is still ultimately built on raw data

    So they’re still feeding LLMs their own slop, got it.

    includes lots of curation steps to filter it for quality

    Ah, so it’s going back to the good old days of curated directories like Yahoo. Of course, because that worked so well.

    I don’t know what you mean by "a replacement for search engines.

    I mean that they’re discontinuing search engines in favour of LLM generated slop. Microsoft just announced it was shutting down the Bing APIs, in favour of Copilot. Google are shoving LLM generated nonsense all over their search. People are asking LLMs questions instead of looking them up in search engines because they’ve been sold the fantasy that you can get useful information out of that shit when it’s evident that all you get is information shaped hallucinated garbage (also because search engines have been intentionally enshittified to the point of being almost as useless). People are being sold dangerous nonsensical misinformation and being told it’s factual information. That’s what I mean.

    there’s still a search engine providing it with sources to generate that summary from

    No there’s not, that’s not how LLMs work, you have to retrain the whole model to get any new patterns into it.

    Even if you stick the LLM between an actual search engine and the user, it just becomes a perverted game of telephone, with the LLM mangling the user’s prompt into a search prompt that almost certainly will have nothing to do with what the user wanted, which will be fed into the aforementioned enshittified search engine, whose shitty useless results will be fed back into the LLM, which will use them to hallucinate some answer (with inexistent references and all) that will look like an answer to the user’s question (if LLMs are good at anything it’s brainwashing their victims into believing that their answers are correct) while having no bearing whatsoever in reality.

    The tragic fact is that LLM’s offer practically no benefits over 40 year old Eliza if you gave it a fraction of the data and computational power they need, while being many orders of magnitude more expensive and resource intensive.

    They have no affordable practical applications whatsoever, and the companies selling them are so desperate to earn back the investment and run off with the money before the bubble bursts and everyone realises that the emperor has been hanging his shriveled little dong in front of our faces the whole time that they’re shoving this shit everywhere (notepad!? fucking seriously!?) whether it makes sense or not, burning off products that used to work, and the Internet itself, and replacing them with useless LLM infected shit so their customers have no option but to buy their useless massively overpriced garbage.







  • At one particular point it was, if I recall correctly, though Chrome also (mis)implements some standards its own way, so Google might also use that as a form of attack against anyone who implements them properly, much like Microsoft did in the bad old IE6 days…

    It’s all a silly arms race, though, with Google coming up with new ways to enshittify the web for anyone not using Chrome or using ad blockers and Mozilla and ad blocker (and alternative YouTube frontend) developers trying to figure out what they broke this time and how to fix it, so what worked yesterday might not work today and work again tomorrow.

    It’s all a profoundly stupid waste of everyone’s time and resources (all for a few more ad views) which will hopefully end up with Google losing their monopoly position on the web like the Internet Explorer bullshit did for Microsoft, but will keep being a major hassle for everyone until it does.