- Alphawise
- Posts
- 2024-12-28
2024-12-28
Alibaba Qwen-VQ - a multimodal heavy weight

A technical AI newsletter - written with an entrepreneurial spirit for builders
Welcome to your daily newsletter on AI
What have we got for you today?
Alibaba Qwen-VQ, a multimodal heavyweight, released
Devin.ai has version 1.1 released - an AI Agent for hire that codes.
Supabase tutorial - spin up a javascript chatbot in 5 minutes
We are 100% free!
And with your support, we create more FREE content!
Please share with a friend, or 10!
🎯 RELEASES 🎯
Bringing insights into the latest trends and breakthroughs in AI
Alibaba
Qwen-VQ 72B - a multimodal heavy weight
Synopsis
The Qwen-VQ 72B model represents a significant step forward in multimodal AI, combining powerful language and visual understanding capabilities. Designed for complex real-world applications, this model enhances performance across diverse tasks, from image analysis to advanced language comprehension, setting a new benchmark in AI versatility.
Core Observations
The Qwen-VQ 72B integrates language and vision processing, enabling tasks like image-to-text generation and multimodal Q&A with state-of-the-art accuracy. Built with 72 billion parameters, it one of the largest multimodal models available.
Multimodal Capabilities:
Demonstrates 85.6% accuracy on multimodal tasks such as visual reasoning and captioning benchmarks, outperforming previous models by over 7%.
Scale and Architecture:
Trained on a 5 trillion-token dataset combining diverse text and visual inputs, ensuring superior generalization.
Benchmark Performance:
Achieved a score of 91.4% on Visual Commonsense Reasoning (VCR), surpassing comparable models by 8 points.
Outperformed competing systems on text-heavy benchmarks with 92.2% accuracy on SQuAD 2.0 and 89.3% on Natural Questions (NQ).
Pretraining and Fine-Tuning:
Uses multi-stage pretraining, optimizing for multimodal inputs without sacrificing efficiency.
Incorporates adaptive sparse attention mechanisms, reducing computational overhead by up to 30% compared to dense architectures.
Efficiency and Scalability:
Delivers faster inference speeds, processing multimodal queries at 50 tokens per second, a 25% improvement over previous-generation models.
Broader Context
Qwen-VQ 72B demonstrates Alibaba’s ability to compete with top LLM providers like OpenAI and Google. While OpenAI leads with GPT-4’s general versatility, and Google excels in R&D with models like Gemini, Qwen-VQ 72B’s multimodal focus carves a niche in combining language and vision capabilities.
Devin.ai
AI Coder for Hire - v 1.1 released
Synopsis
Devin is basically an AI agent that contributes to your code base - so think of it as an AI coding agent for hire! Cognition Labs has unveiled Devin 1.1, an upgraded AI model designed for code-editing tasks. The model delivers an increase in performance speed and reduced operational costs, making waves as a tool for developers and businesses optimising their software workflows
Core Observations
Performance Improvements:
Operates 10% faster than its predecessor
Cost Reductions:
Operational costs for code-editing tasks have decreased by 12%
Enhanced Accuracy:
Improvements to language understanding and error correction reduce debugging times by up to 15%
Scalable Deployment:
Optimized for integration into CI/CD pipelines, enabling large-scale adoption in enterprise environments.
Broader Context
Cognition Labs’ Devin 1.1 positions the company as a competitive player in the AI-driven development tools market, rivaling OpenAI Codex and GitHub Copilot. While top providers focus on feature-rich models, Devin 1.1 emphasizes speed, cost-efficiency, and accuracy, making it a practical choice for industry adoption.
Try it out here
⚙️ BUILDERS BYTES ⚙️
Informing builders of latest technologies and how to use them
What will you learn today?
Learn how to build a AI chatbot using Supabase as the database backend. This is a javascript tutorial that mostly shows how to use the technology (less emphasis on programming).

the web app
Key Takeaways
Supabase Integration: Use Supabase for secure, scalable data storage and real-time interactions with your chatbot.
Vercel Deployment: Deploy a fast, serverless AI chatbot on Vercel with minimal setup and configuration.
OpenAI Integration: Integrate OpenAI models to power intelligent, natural-language responses.
Code Example Provided: Hands-on example with clear steps for connecting Supabase and Vercel in a chatbot project.
Scalable Architecture: Leverage Supabase and Vercel for a cost-effective, production-ready AI chatbot solution.
$ git clone https://github.com/supabase-community/vercel-ai-chatbot
$ npm supabase start
$ npm install
$ npm run dev
We just wanted to show you a snippet for now. The full tutorial is available in our newsletter repo 👉️ code
Do you have a product in AI and would like to contribute?
👉️ email us: [email protected]
Is there something you’d like to see in this section?
👉️ share your feedback
Trending
🤩 COMMUNITY 🤩
Cultivating curiosity with latest in professional development
Talks & Events
THANK YOU
Our Mission at AlphaWise
AlphaWise strives to cultivate a vibrant and informed community of AI enthusiasts, developers, and researchers. Our goal is to share valuable insights into AI, academic research, and software that brings it to life. We focus on bringing you the most relevant content, from groundbreaking research and technical articles to expert opinions to curated community resources.
Looking to connect with us?
We actively seek to get involved in community with events, talks, and activities. Email us at [email protected]