Alibaba Cloud launches open source Large Vision Language Model Qwen-VL

nodetechno7 août 28, 2023

On August 25, Alibaba Cloud launched an open-source Large Vision Language Model (LVLM) named Qwen-VL. The LVLM is based on Alibaba Cloud’s 7 billion parameter foundational language model Qwen-7B. In addition to capabilities such as image-text recognition, description, and question answering, Qwen-VL introduces new features including visual location recognition and image-text comprehension, the company said in a statement. These functions enable the model to identify locations in pictures and to provide users with guidance based on the information extracted from images, the firm added. The model can be applied in various scenarios including image and document-based question answering, image caption generation, and fine-grained visual recognition. Currently, both Qwen-VL and its visual AI assistant Qwen-VL-Chat are available for free and commercial use on Alibaba’s “Model as a Service” platform ModelScope. [Alibaba Cloud statement, in Chinese]

TechNode

Techno Node

Ticker

Alibaba Cloud launches open source Large Vision Language Model Qwen-VL

Enregistrer un commentaire

0 Commentaires

Subscribe Us

Popular Posts

Xiaomi CEO says 3nm Xuanjie O1 chip shipments surpass one million

Shaktikanta Das Reappointed as RBI Governor

Beijing forbids generative AI in online medical prescriptions

Toyota On The Charge With $13.5bn Electric Vehicle Battery Tech Pledge

Hyundai Taking the Hydrogen Route With Commercial Vehicles

POP MART’s LABUBU makes a surprise appearance at the 2026 FIFA World Cup opening ceremony

Japan Looking to Protect Rare Earth Resources From Takeover

China Property Sales Rebound As Mortgage Controls Ease

Asia Markets Boosted By Earnings Data But Fed Pullback Looms

China penalizes AI platforms over failure to label AI-generated content

Random Posts

Recent in Sports

Popular Posts

Xiaomi CEO says 3nm Xuanjie O1 chip shipments surpass one million

Shaktikanta Das Reappointed as RBI Governor

Beijing forbids generative AI in online medical prescriptions

Footer Menu Widget

Ticker

Ad Code

Alibaba Cloud launches open source Large Vision Language Model Qwen-VL

Ces posts pourraient vous intéresser

Enregistrer un commentaire

0 Commentaires

Social Plugin

Subscribe Us

Popular Posts

Random Posts

Recent in Sports

Popular Posts

Footer Menu Widget