Mastodon @Mastodon@mastodon.social

**Kagan MacTane (he/him)** @kagan@wandering.shop · 10h

Kagan MacTane (he/him) @kagan@wandering.shop

One thing that makes me very happy about #NaNoWriMo crashing and burning: It's good evidence that the #writing world is not even remotely buying into the AI hype. (Unlike another industry I'm involved in... )

Having poor moderation practices that enabled grooming? That was bad for them. But promoting AI? That *killed* them. Writers are going: "AI? Fuck, no! ️

#AI #AICrap #AIHype

**bartosz** @btel@mastodon.social · 1d

bartosz @btel@mastodon.social

Who could have predicted this? state-of-the-art LLMs score 5% on the 2025 mathematical olympiad despite having been trained extensively on past editions :

https://arxiv.org/abs/2503.21934

arXiv.orgProof or Bluff? Evaluating LLMs on 2025 USA Math OlympiadRecent math benchmarks for large language models (LLMs) such as MathArena indicate that state-of-the-art reasoning models achieve impressive performance on mathematical competitions like AIME, with the leading model, o3-mini, achieving scores comparable to top human competitors. However, these benchmarks evaluate models solely based on final numerical answers, neglecting rigorous reasoning and proof generation which are essential for real-world mathematical tasks. To address this, we introduce the first comprehensive evaluation of full-solution reasoning for challenging mathematical problems. Using expert human annotators, we evaluated several state-of-the-art reasoning models on the six problems from the 2025 USAMO within hours of their release. Our results reveal that all tested models struggled significantly, achieving less than 5% on average. Through detailed analysis of reasoning traces, we identify the most common failure modes and find several unwanted artifacts arising from the optimization strategies employed during model training. Overall, our results suggest that current LLMs are inadequate for rigorous mathematical reasoning tasks, highlighting the need for substantial improvements in reasoning and proof generation capabilities.

#ai #AIhype #llm

**Robert Kingett** @WeirdWriter@caneandable.social · 3d

Robert Kingett @WeirdWriter@caneandable.social

Do LLMs Really Understand? https://www.youtube.com/watch?v=YtIQVaSS5Pg #AI #AIHype #Debate #Tech #Technology

YouTubeCHM Live | The Great Chatbot Debate: Do LLMs Really Understand?By Computer History Museum

**Hyperactivemike** @hyperactivemike@mastodon.social · 3d

Hyperactivemike @hyperactivemike@mastodon.social

seeing what ChatGPT, Claude and Gemini can do today, I think that soon AI-generated products and services will be fighting with other AI-generated products and services.

Very interesting times await us.

How to find your way in this? I have an idea - I will become an ascetic, or some slow-life farmer. #AIHype

**Bornach** @bornach@masto.ai · 3d

Bornach @bornach@masto.ai

Ignore the techbro grifters peddling #AIhype and promising #AGI real-soon-now

We haven't even solved the basic problems of AI reasoning

https://youtu.be/vpNmKN2szt8

[Discover AI] looks at recent publications on the problems #AI #LLM reasoning research are now encountering

youtu.be- YouTubeEnjoy the videos and music you love, upload original content, and share it all with friends, family, and the world on YouTube.

#GenerativeAI #overthinking

**jbz** @jbz@indieweb.social · 3d

jbz @jbz@indieweb.social

Alibaba’s Tsai Warns of ‘Bubble’ in AI Data Center Buildout

“I start to see the beginning of some kind of bubble,” he told delegates to the summit. Some of the envisioned projects commenced raising funds without having secured “uptake” agreements, he added. “I start to get worried when people are building data centers on spec. There are a number of people coming up, funds coming out, to raise billions or millions of capital.”

https://finance.yahoo.com/news/alibaba-tsai-warns-bubble-ai-020549819.html

Yahoo Finance · Mar 25Alibaba’s Tsai Warns of ‘Bubble’ in AI Data Center BuildoutBy Luz Ding

#ai #aihype #bubble

**Robert Kingett** @WeirdWriter@caneandable.social · Mar 27 *

Mar 27 *

Robert Kingett @WeirdWriter@caneandable.social

This article solidifies why I never take these #Accessibility businessmen/professionals seriously. First of all, for an accessibility practitioner, he doesn’t understand the technology he is writing about. A longer blog post will be coming, but this is why I roll my eyes at accessibility professionals today. AI is the future of accessibility - Karl Groves https://karlgroves.com/ai-is-the-future-of-accessibility/ #A11y #AIHype #AI

Karl Groves - Web Accessibility Viking · Mar 23AI is the future of accessibility - Karl GrovesWhen I was a kid, my mom bought a 1979 Camaro Berlinetta. It was a deep blue color with a 350 cubic inch/ 5.7 liter engine. In 1979, that Camaro had 185 horsepower. The ‘79 Z/28 had 245 horsepower. In comparison, the 2023 Toyota Camry’s base engine is 2.5 liters with 203 horsepower. The sporty

**Miguel Afonso Caetano** @remixtures@tldr.nettime.org · Mar 25

Mar 25

Miguel Afonso Caetano @remixtures@tldr.nettime.org

"My core theses — The Rot Economy (that the tech industry has become dominated by growth), The Rot-Com Bubble (that the tech industry has run out of hyper-growth ideas), and that generative AI has created a kind of capitalist death cult where nobody wants to admit that they're not making any money — are far from comfortable.

The ramifications of a tech industry that has become captured by growth are that true innovation is being smothered by people that neither experience nor know how (or want) to fix real problems, and that the products we use every day are being made worse for a profit. These incentives have destroyed value-creation in venture capital and Silicon Valley at large, lionizing those who are able to show great growth metrics rather than creating meaningful products that help human beings.

The ramifications of the end of hyper-growth mean a massive reckoning for the valuations of tech companies, which will lead to tens of thousands of layoffs and a prolonged depression in Silicon Valley, the likes of which we've never seen.

The ramifications of the collapse of generative AI are much, much worse. On top of the fact that the largest tech companies have burned hundreds of billions of dollars to propagate software that doesn't really do anything that resembles what we think artificial intelligence looks like, we're now seeing that every major tech company (and an alarming amount of non-tech companies!) is willing to follow whatever it is that the market agrees is popular, even if the idea itself is flawed.

Generative AI has laid bare exactly how little the markets think about ideas, and how willing the powerful are to try and shove something unprofitable, unsustainable and questionably-useful down people's throats as a means of promoting growth.
(...)
In short, reality can fucking suck, but a true skeptic learns to live in it."

https://www.wheresyoured.at/optimistic-cowardice/

Ed Zitron's Where's Your Ed At · Mar 24The Phony Comforts of AI OptimismA few months ago, Casey Newton of Platformer ran a piece called "The phony comforts of AI skepticism," framing those who would criticize generative AI as "having fun," damning them as "hyper-fixated on the things [AI] can't do." I am not going to focus too hard on this blog, in

#AI #GenerativeAI #BigTech

**Robert Kingett** @WeirdWriter@caneandable.social · Mar 25 *

Mar 25 *

Robert Kingett @WeirdWriter@caneandable.social

Despite the hopeful tone this takes regarding LLMs and openly speculating if future LLM image descriptions will be better, this is a really great breakdown as to why AI is not capable of writing good image descriptions. Can generative AI write contextual text descriptions? - TetraLogical https://tetralogical.com/blog/2025/03/24/can-generative-ai-write-contextual-text-descriptions/ #AI #AltText #Accessibility #AIHype

TetraLogicalCan generative AI write contextual text descriptions? - TetraLogicalIn 2025, Artificial Intelligence (AI) and Large Language Models (LLM) like ChatGPT, Gemini, Claude, and DeepSeek are being used for everything. Writing emails. Generating code. Even applying for jobs. But, can they write good text descriptions for images?

**Yuna** @LunaFreyja@hachyderm.io · Mar 21

Mar 21

Yuna @LunaFreyja@hachyderm.io

AI Will Replace Engineers?
Oh, sweet summer child…

AI isn’t thinking. It’s autocomplete on steroids.
It guesses confidently, fails quietly, and we pretend it’s magic.

Engineers won’t be replaced.
They’ll be cleaning up AI’s mess.

But sure, dream of your AI CEO…
Full Post: https://www.linkedin.com/posts/yuna-morgenstern-6662a5145_ai-will-replace-engineers-oh-sweet-summer-activity-7308577691148980227-rRw1

An engineer asking 8-Ball where to put the next card on the house of cards

#AI #SoftwareEngineering #PromptEngineering

**jbz** @jbz@indieweb.social · Mar 19

Mar 19

jbz @jbz@indieweb.social

The shovel store will now sell you the gold itself, or something along those lines, couldn't understand much more from his desperate tone.

https://www.youtube.com/watch?v=ZXxsASPC8Hs

YouTubeNvidia CEO Huang Says AI Is at an Inflection PointBy Bloomberg Technology

#nvidia #ai #aihype

**Bornach** @bornach@masto.ai · Mar 17

Mar 17

Bornach @bornach@masto.ai

Sam Altman asks #Trump to grant #OpenAI blanket permissions to steal everyone's copyrighted works so he can train his #AI. He raises the threat of China AI dominance as the bogeyman.
https://futurism.com/openai-over-copyrighted-work
#copyright #FairUse #AIhype #SamAltman

Futurism · Mar 16OpenAI Says It’s "Over" If It Can’t Steal All Your Copyrighted WorkBy Noor Al-Sibai

**Bornach** @bornach@masto.ai · Mar 14

Mar 14

Bornach @bornach@masto.ai

It is best not to trust the research summaries generated by these Deep Research AI's according to Jordan Harrod

https://youtu.be/OdZq3DJSFHE

youtu.be- YouTubeEnjoy the videos and music you love, upload original content, and share it all with friends, family, and the world on YouTube.

#AI #GenerativeAI #DeepResearch

**Carl Gold, PhD** @carl24k@sigmoid.social · Mar 12

Mar 12

Carl Gold, PhD @carl24k@sigmoid.social

Think current #LLM already empower ANYONE to code? Time to put down the kool aid... #aihype #learntocode #DataScience #MachineLearning

https://www.theguardian.com/games/2025/mar/11/ai-pac-man-clones-reviewed-grok

The Guardian · Mar 11‘A lot worse than expected’: AI Pac-Man clones, reviewedBy Rich Pelley

**janhoglund** @janhoglund@mastodon.nu · Mar 9

Mar 9

janhoglund @janhoglund@mastodon.nu

Elon Musk’s 2019 prediction: “I feel very confident predicting that there will be autonomous robotaxis from Tesla next year… From our standpoint, if you fast forward a year, maybe a year and three months, but next year for sure, we’ll have over a million robotaxis on the road.”
https://garymarcus.substack.com/p/nobel-prizes-and-the-ai-hype-hall
#musk #ai #aihype

Marcus on AI · Mar 9Nobel Prizes and The AI Hype Hall of FameBy Gary Marcus

**Teixi** @teixi@mastodon.social · Mar 9

Mar 9

Teixi @teixi@mastodon.social

We are living this decade of #aiHype lamebrain experts every year with imminent #AIRadiologists standalone replacement of #HumanRadiologists pushed out all over the news...

Uncurious to learn other fields #IroniesOfAutomation research

https://erictopol.substack.com/p/when-doctors-with-ai-are-outperformed

Last paper closer:

» Moreover, #radiologists take significantly more time to make a decision when #AI information is provided «

https://economics.mit.edu/sites/default/files/2023-07/agarwal-et-al-diagnostic-ai.pdf

#AutomationParadox #LessonsLearned in #Aviation #HumanFactors applied training

1/2

Ground Truths · Feb 2When Doctors With A.I. Are Outperformed by A.I. AloneBy Eric Topol

**Timnit Gebru (she/her)** @timnitGebru@dair-community.social · Mar 7

Mar 7

Timnit Gebru (she/her) @timnitGebru@dair-community.social

I haven't even used "foundation models" (still refuse) and now that seems out of fashion and we're at "frontier models."

The lack of uncritical uptake of all these terms pushing #AIHype is incredible.

**Christian Mayer** @TheFox21@mastodon.social · Mar 6

Mar 6

Christian Mayer @TheFox21@mastodon.social

What does "LLM Models" mean? — Large Language Model Models?
#ai #llm #llms #terminology #bullshit #ki #aihype

Continued thread

**janhoglund** @janhoglund@mastodon.nu · Mar 6

Mar 6

janhoglund @janhoglund@mastodon.nu

“There is a long, long, long way from writing basic reports to the kind of AI that could match the originality of top human scientists.”
—Gary Marcus, Ezra Klein’s new take on AGI – and why I think it’s probably wrong
#ai #agi #aihype

**janhoglund** @janhoglund@mastodon.nu · Mar 6

Mar 6

janhoglund @janhoglund@mastodon.nu

“…a quarter century has passed without a principled solution to even one of those problems.”
—Gary Marcus, Ezra Klein’s new take on AGI – and why I think it’s probably wrong
https://garymarcus.substack.com/p/ezra-kleins-new-take-on-agi-and-why
#agi #ai #aihype

Marcus on AI · Mar 5Ezra Klein’s new take on AGI – and why I think it’s probably wrongBy Gary Marcus

Recent searches

Search options

Administered by:

Server stats:

#aihype