• Alphawise
  • Posts
  • Deepseek & Silicon Valley's reaction; Huggingface will rebuild to verify open source

Deepseek & Silicon Valley's reaction; Huggingface will rebuild to verify open source

View today's featured podcast with DeepMind CEO on an optimistic view of the world in the next 5 years

A technical AI newsletter
written with an entrepreneurial spirit for builders

What is today’s beat?

NEWSROOM


FOLLOW US ON SOCIAL FOR MORE


Your FREE newsletter
 share or subscribe 
to show support

🧨 Deepseek R1 🧨

Why is Silicon Valley Excited, Upset, and Motivated?

Yann LeCun - LinkedIn

DeepSeek's R1 model has been launched as a significant competitor to OpenAI's O1, leveraging reinforcement learning to achieve comparable performance at one-third the cost. This development highlights a turning point in the open-source versus proprietary AI landscape, emphasizing affordability and accessibility.

  • Who is the company?
    Hong Kong-based quantitative analysis (quant) firm High-Flyer Capital Management started investing in Nvidia GPUs with its extra capital. It turned tech with investment to create their model!

  • Resources to build
    Deepseek built Deepseek R1, a direct competitor to OpenAI’s o1 model, at 200x less compute budget for a total of $6M USD for its compute budget. This is what it takes to build a state-of-the-art world class model that is free for everyone to use (with daily cap on usage).

  • Benchmarks
    Google Gemini is currently first, and Deepseek R1 and OpenAI o1 are tied for 3rd showing near similar results in all areas. Except Deepseek R1 is 98% cheaper to use than OpenAI o1. So, go ahead and cancel that OpenAI account of yours, and enjoy Deepseek for FREE for daily use (up to 1M tokens per day).

  • Open Source
    Code, training data set, and guidelines to train it yourself make this an absolute smash hit across all industries. This truly democratizes AI with efforts like this.

DeepSeek R1 represents a critical shift in the AI industry, emphasizing cost-effective innovation and open-source collaboration. We’ll just finish off with a comment that Yann LeCun - the Chief AI Scientist for Meta - posted on on LinkedIn.


“To people who see the performance of DeepSeek and think: ‘China is surpassing the US in AI.’ You are reading this wrong. The correct reading is: ‘Open source models are surpassing proprietary ones.’

⭐️ SPECIAL HIGHLIGHT ⭐️

Huggingface will rebuild Deepseek R1

Instructions to participate in link below

HuggingFace is redoing the training process to validate that Deepseek R1 is truly open source, and you can follow their progress here on Github.

🗞️ NEWSROOM 🗞️ 

What’s hot in tech right now?

How Govt Could Reshape IP And Small Business In The US
A CrunchBase article that examines how the Trump administration's IP policies shaped challenges and opportunities for small businesses, focusing on patent protections and innovation dynamics.

Meet the startup shaking up UK education AI policy: Faculty AI
It seems the UK is taking a different long-term approach (and slightly under the radar) with education. A look at AI policy innovation in UK education, tackling ethical concerns, equitable access, and AI-driven teaching tools to reshape learning experiences.

AI Maps Titan’s Clouds with Nvidia Tech
Explores how AI and deep learning models unravel the mysteries of Titan's hazy, methane-rich clouds using advanced GPU-powered simulations and data from NASA's Cassini mission.

Kodiak's Driverless Trucks Deliver for Atlas Energy
Covers Kodiak Robotics' milestone of completing its first driverless truck deliveries, marking a significant step in autonomous logistics for the energy sector.

⭐️ BUILDER BYTES ⭐️

What’s hot for builders right now?

Smol Agents Get a Vision Boost
Huggingface Blog explores how Smol Agents integrate vision capabilities, enabling multimodal tasks like image-based question answering and planning. Note, this is part of the world’s smallest large vision models released by HuggingFace.

VideoLLaMA: Multimodal AI for Video Understanding
An Alibaba based research introduces a multimodal AI model combining video and language understanding, enabling tasks like video-based Q&A, summarization, and captioning.

LLAMA Stack: Meta's AI Dev Toolkit
Meta research completes its first complete, comprehensive open source framework releases - Llama stack. View details on training, fine-tuning, and deploying large language models, emphasizing modularity and scalability.

Llasa-3B: Speech-Aware Language Modeling
Top Hong Kong researcher releases trained from scratch TTS model. Showcases a 3B parameter model optimized for integrating speech and text inputs, designed for speech-driven AI applications like transcription and voice assistants with code.

🤩 COMMUNITY 🤩

What’s the latest beat?

Podcast

⭐️ Today’s favorite - very insightful and intriguing for all!

Big Technology Podcast with Google Deepmind CEO
Demis Hassabis discusses major topics in AI, with skepticism and an optimistic view that touches on subjects like AGI, deceptive agents, and virtual cell simulations to help advance healthcare.

Podcast

Google Deepmind PodCast- exploring project Astra
Google gives a deepdive into their project Astra, and its ambitions to make a universal AI assistant (and it is currently available). A great look into how Google’s technology suite will change its business centric platforms like GSuite, Gmail, Docs, and more.

Podcast

StackOverflow: How the internet changed in 2024
John Graham-Cumming, CTO of Cloudflare, joins Ben and Ryan for a conversation about the latest trends in internet usage highlighted in Cloudflare's 2024 Year in Review report.

THANK YOU

👇️ we are 100% free, so please let us know what you think! 👇️ 

Looking to promote your company, product, service, or event?