Mastodon @Mastodon@mastodon.social

**eicker.news tech news** @technews@eicker.news · Mar 2

eicker.news tech news @technews@eicker.news

»#AI companies race to use #distillation to produce #cheapermodels: DeepSeek used technique to create smaller powerful models based on the technology of competitors such as Meta.« https://www.ft.com/content/c117e853-d2a6-4e7c-aea9-e88c7226c31f?eicker.news #tech #media

**PUPUWEB Blog** @pupuweb@mastodon.social · Mar 2

Mar 2

PUPUWEB Blog @pupuweb@mastodon.social

DeepSeek, OpenAI & others are advancing faster with smaller, powerful models using 'distillation,' challenging the LLM first-mover advantage. Could this shift the landscape? #AI #MachineLearning #LLMs #OpenAI #DeepSeek #Distillation #TechInnovation

**Knowledge Zone** @kzoneind@mstdn.social · Feb 28

Feb 28

Knowledge Zone @kzoneind@mstdn.social

#ITByte: #LLM #Distillation is a technique used to create smaller, more efficient versions of large language models (LLMs).

It involves training a smaller "student" model to mimic the behavior and knowledge of a larger "teacher" LLM.

https://knowledgezone.co.in/posts/LLM-Distillation-67b2ea21ebfe6b268acbf091

Replied in thread

**Emmie Hine** @emmiehine@dair-community.social · Feb 10

Feb 10

Emmie Hine @emmiehine@dair-community.social

The latest AI buzzword? #Distillation
- #OpenSource AI models are rapidly advancing—by copying closed models.
- Companies like OpenAI & Google hate it, but they can’t stop it.
- The big question: Is open-source really catching up to proprietary AI?

**Nicole Hennig** @nic221@techhub.social · Feb 7

Feb 7

Nicole Hennig @nic221@techhub.social

DeepSeek’s R1 and OpenAI’s Deep Research just redefined AI — RAG, distillation, and custom models will never be the same https://venturebeat.com/ai/deepseeks-r1-and-openais-deep-research-just-redefined-ai-rag-distillation-and-custom-models-will-never-be-the-same/ #AI #RAG #distillation

Text Shot: This distillation of models plus RAG is where the magic will come for most companies. It has become so incredibly easy to do, even for those with limited data science or coding expertise. I personally downloaded the DeepSeek distilled 1.5b Qwen model, the smallest one, so that it could fit nicely on my Macbook Air. I then loaded up some PDFs of job applicant resumes into a vector database, then asked the model to look over the applicants to tell me which ones were qualified to work at VentureBeat. (In all, this took me 74 lines of code, which I basically borrowed from others doing the same).

I loved that the Deepseek distilled model showed its thinking process behind why or why not it recommended each applicant — a transparency that I wouldn’t have gotten easily before Deepseek’s release.

**Nicole Hennig** @nic221.bsky.social@bsky.brid.gy · Feb 7

Feb 7

Nicole Hennig @nic221.bsky.social@bsky.brid.gy

**Habr** @habr@zhub.link · Feb 3

Feb 3

Habr @habr@zhub.link

Поднимаем DeepSeek llm локально

Все уже слышали про новую модель deepseek r1, которая обогнала по бенчмаркам openai. Компания Deepseek выложила веса и дистилляты в открытый доступ, благодаря чему мы можем их запустить. В статье поднимем дистилляты модели r1 используя llama.cpp - потребуются лишь базовые умения работы с bash, docker и python. Самостоятельный запуск проще простого.

https://habr.com/ru/articles/878836/

ХабрПоднимаем DeepSeek llm локальноВсе уже слышали про новую модель DeepSeek r1, которая обогнала по бенчмаркам openai. Компания DeepSeek выложила веса и дистилляты в открытый доступ, поэтому мы можем их запустить. В статье поднимем...

#llm #deepseek #unsloth

**卡拉今天看了什麼** @ai_workspace@social.mikala.one · Jan 29

Jan 29

卡拉今天看了什麼 @ai_workspace@social.mikala.one

OpenAI已掌握證據「DeepSeek侵權」，盜用GPT distillation技術訓練中國AI | 動區動趨-最具影響力的區塊鏈新聞媒體

Link

Summary: OpenAI 揭露中國 AI 公司 DeepSeek 涉嫌使用其專有技術進行模型訓練，可能違反智慧財產權。DeepSeek 的開源多模態模型「Janus-Pro」迅速引起市場關注，影響美國 AI 巨頭股價。微軟協助 OpenAI 調查，已發現 DeepSeek 利用 OpenAI 的 API 進行了不當的 distillation 技術。OpenAI 表示將採取反制措施以保護其技術，但因該技術在業界普遍使用，維權之路面臨挑戰。

Key Points:
- OpenAI 發現 DeepSeek 使用其專有模型進行訓練，疑似侵犯智慧財產權。
- DeepSeek 推出的「Janus-Pro」模型以低成本、高性能威脅美國 AI 市場。
- 微軟已協助 OpenAI 進行調查，查明 DeepSeek 透過 API 進行違規行為。
- OpenAI 將加強維權並與政府合作，但維權措施可能面對許多挑戰。
- OpenAI 本身也面臨其他著作權訴訟。

Keywords: #OpenAI #DeepSeek #智慧財產權 #AI #distillation

動區動趨 BlockTempo · Jan 29OpenAI已掌握證據「DeepSeek侵權」，盜用GPT distillation技術訓練中國AIBy Editor Jr.

**Infrogmation** @Infrogmation · Dec 17, 2024

Dec 17, 2024

Infrogmation @Infrogmation

MY NEWS: Team makes distilled wine in replica of bronze vessel found at emperor’s tomb

Archaeologists conduct experiment to recreate distillation process from Western Han dynasty using replica still and taro

https://www.scmp.com/news/china/science/article/3290709/team-makes-distilled-wine-replica-bronze-vessel-found-emperors-tomb

South China Morning Post · Dec 13, 2024Team makes distilled wine in replica of bronze vessel found at emperor’s tombArchaeologists conduct experiment to recreate distillation process from Western Han dynasty using replica still and taro.

#Archaeology #China #distillation

**DynoFlux** @nflux@exile.social · Nov 25, 2024 *

Nov 25, 2024 *

DynoFlux @nflux@exile.social

Last night's reactor setup worked pretty well. It based on a big tea urn which has some water in and the actual still placed inside it, with the tubing coming out through the lid and is effectively a bain-marie type setup so shouldn't be able to scorch the mash, which means we should be able to leave any fruit in for additional flavour (on runs which have fruit, like for my apple molotov) #distillation #moonshine

A large cylindrical vessel, wrapped in silver bubble wrap, with some flexible stainless steel tubes, leading to a smaller vessel (doubler), and then to a small condenser

**michabbb** @michabbb@vivaldi.net · Oct 19, 2024

Oct 19, 2024

michabbb @michabbb@vivaldi.net

A very interesting interview....

#OpenAI CPO #KevinWeil on the Future of #AI

Summary:

#ProductManagement at #OpenAI: Kevin Weil discusses differences in product management compared to other companies, highlighting that AI advancements make product development more dynamic and unpredictable.

AI's Rapid Evolution: #AI technology evolves faster than ever, with computers gaining new capabilities monthly, which requires adaptability and quick decision-making in #productstrategy.

#Distillation and #CostReduction: Distillation is a key innovation that allows AI models to be optimized for specific tasks, significantly reducing costs (99% cost reduction since #GPT3).

Advanced Reasoning with #01Model: The 01 model introduces advanced reasoning capabilities, enabling AI to hypothesize and refine its thinking, which is particularly effective in fields like #science and #mathematics.

Insights Based on Numbers:

99% cost reduction in two years: The evolution from #GPT3 to #GPT4 models shows a dramatic decrease in cost, making AI more accessible and practical for diverse applications.

More than 3 million developers using OpenAI: This showcases the wide adoption of OpenAI's tools and the potential impact of AI across industries.

https://youtu.be/VsmEMUiPXIs?si=rkB7_ECm71BH59sf

youtu.be- YouTubeEnjoy the videos and music you love, upload original content, and share it all with friends, family, and the world on YouTube.

Continued thread

**Susan Larson** @Susan_Larson_TN · Oct 8, 2024

Oct 8, 2024

Susan Larson @Susan_Larson_TN

The Boundless #Destruction of #DonaldTrump’s #MassDeportation #Plan.

His #solution to the #housingcrisis is the perfect #distillation of #Trumpism: It’s mind-blowingly #evil, and it has no hope of #working.

https://newrepublic.com/article/186686/trump-vance-mass-deportation-plan

The New Republic · Oct 8, 2024The Boundless Destruction of Donald Trump’s Mass Deportation PlanHis solution to the housing crisis is the perfect distillation of Trumpism: It’s mind-blowingly evil, and it has no hope of working.

**Habr** @habr@zhub.link · Oct 3, 2024

Oct 3, 2024

Habr @habr@zhub.link

Всем про LLM. Как рассказать про трансформеры одинаково хорошо и индустриалам, и исследователям

Привет, Хабр. Меня зовут Вика, я работаю в AIRI, преподаю в Школе Анализа Данных и Сколтехе и вместе со своими коллегами занимаюсь обработкой естественного языка, изображений и видео, а также иными задачами, где могли бы пригодиться трансформерные модели. Трансформерные архитектуры — очень мощное орудие, которые может быть применено почти во всех сферах DL, и интереснейший концепт, в котором много потенциала для исследования. А, главное, их очень легко применить к технологиям, которые способны изменить нашу жизнь здесь и сейчас. На словах всё красиво. Но три года назад мы заметили, что и магистры, и работники индустрии, связанной с AI, часто просят «объяснить, как же все‑таки работают трансформеры, потому что из научной статьи ничего не понятно». Так происходит из‑за того, что многое, что в статьях считается очевидным и само собой разумеющимся, очень плохо разъясняется в учебной литературе или существующих курсах. Как следствие, многие не могут использовать трансформеры для решения практических задач и реализации своих идей. Эта трудность побудила нас создать полноценный курс по трансформерам, в котором проработаны такие проблемные точки и который адаптирован для студентов с разным профессиональным бэкграундом. О нём я и расскажу в этой статье. Мы уже апробировали курс на лекциях в Сколтехе, МГУ и Сбер Университете, и написали в AIRI о нём статью , которую представили на воркшопе по преподаванию на одной из самых популярных мировых конференций по NLP — ACL-2024. Материалы академической версии курса можно найти в нашем репозитории . Приятного чтения!

https://habr.com/ru/companies/airi/articles/847348/

ХабрВсем про LLM. Как рассказать про трансформеры одинаково хорошо и индустриалам, и исследователямПривет, Хабр. Меня зовут Вика, я работаю в AIRI, преподаю в Школе Анализа Данных и Сколтехе и вместе со своими коллегами занимаюсь обработкой естественного языка, изображений и видео,...

#трансформеры #преподавание #llm