Title: LLaVA-NeXT: Improved reasoning, OCR, and world knowledge | LLaVA
Open Graph Title: LLaVA-NeXT: A Strong Zero-shot Video Understanding Model
X Title: LLaVA-NeXT: A Strong Zero-shot Video Understanding Model
Description: LLaVA team presents LLaVA-NeXT, with improved reasoning, OCR, and world knowledge. LLaVA-NeXT even exceeds Gemini Pro on several benchmarks.
Opengraph URL: https://llava-vl.github.io/blog/2024-04-30-llava-next-video/
Generator: Jekyll v3.9.4
Domain: llava-vl.github.io
{
"@context": "https://schema.org",
"@type": "BlogPosting",
"author": {
"@type": "Person",
"name": "Yuanhan Zhang, Bo Li, Haotian Liu, Yong Jae Lee, Liangke Gui, Di Fu, Jiashi Feng, Ziwei Liu, Chunyuan Li"
},
"dateModified": "2024-01-30T12:33:38-06:00",
"datePublished": "2024-01-30T12:33:38-06:00",
"description": "LLaVA team presents LLaVA-NeXT, with improved reasoning, OCR, and world knowledge. LLaVA-NeXT even exceeds Gemini Pro on several benchmarks.",
"headline": "LLaVA-NeXT: Improved reasoning, OCR, and world knowledge",
"mainEntityOfPage": {
"@type": "WebPage",
"@id": "https://llava-vl.github.io/blog/2024-01-30-llava-next/"
},
"url": "https://llava-vl.github.io/blog/2024-01-30-llava-next/"
}
| None | IE=edge |
| author | Yuanhan Zhang, Bo Li, Haotian Liu, Yong Jae Lee, Liangke Gui, Di Fu, Jiashi Feng, Ziwei Liu, Chunyuan Li |
| og:locale | en_US |
| og:site_name | LLaVA |
| og:type | article |
| article:published_time | 2024-04-30T12:33:38-06:00 |
| twitter:card | summary |
Links:
Viewport: width=device-width, initial-scale=1