Fundamental Vision Lab

Publications

We try our best to do research with long-term impact.

Highlighted

All

Showing 2 of 72 results
Clear search

2024

How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites
Zhe Chen, Weiyun Wang, Hao Tian, Shenglong Ye, Zhangwei Gao, ..., Tong Lu, Dahua Lin, Yu Qiao, Jifeng Dai, Wenhai Wang
Arxiv Tech Report 2024   ·   01 May 2024   ·   arxiv:2404.16821

2023

ControlLLM: Augment Language Models with Tools by Searching on Graphs
Zhaoyang Liu, Zeqiang Lai, Zhangwei Gao, Erfei Cui, Ziheng Li, ..., Lewei Lu, Qifeng Chen, Yu Qiao, Jifeng Dai, Wenhai Wang
Arxiv Tech Report 2023   ·   19 Dec 2023   ·   arxiv:2310.17796

2022

2021

2020

2019

2018

2017

2016

2015