Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
ccclemenfff
/
AVL
like
0
Visual Question Answering
Transformers
Safetensors
OpenGVLab/VideoChat2-IT
Lin-Chen/ShareGPT4V
liuhaotian/LLaVA-Instruct-150K
English
videollama2_mistral
text-generation
multimodal large language model
large video-language model
arxiv:
2406.07476
arxiv:
2306.02858
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
AVL
/
videollama2
/
model
58.4 kB
2 contributors
History:
3 commits
ccclemenfff
change flash_attention_2 to eager
142339c
6 months ago
__init__.py
9.79 kB
change flash_attention_2 to eager
6 months ago
encoder.py
5.46 kB
change flash_attention_2 to eager
6 months ago
projector.py
Safe
8.83 kB
Add videollama2 model code
6 months ago
videollama2_arch.py
12.9 kB
Add videollama2 model code
6 months ago
videollama2_llama.py
5.36 kB
Add videollama2 model code
6 months ago
videollama2_mistral.py
5.46 kB
Add videollama2 model code
6 months ago
videollama2_mixtral.py
5.29 kB
Add videollama2 model code
6 months ago
videollama2_qwen2.py
5.31 kB
Add videollama2 model code
6 months ago