'vision language model' 태그의 글 목록

250x250

Notice

Recent Posts

Recent Comments

Link

« 2025/04 »
일	월	화	수	목	금	토
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30

Tags more

Archives

Today

Total

관리 메뉴

글쓰기
방명록
RSS
관리

목록vision language model (1)

My Vision, Computer Vision

[논문 리뷰/요약] VGA: Vision GUI Assistant - Minimizing Hallucinations through Image-Centric Fine-Tuning

VGA: Vision GUI Assistant -- Minimizing Hallucinations through Image-Centric Fine-TuningRecent advances in Large Vision-Language Models (LVLMs) have significantly improve performance in image comprehension tasks, such as formatted charts and rich-content images. Yet, Graphical User Interface (GUI) pose a greater challenge due to their structuarxiv.orgAbstract기존 VLM은 시각적 입력을 무시하고 텍스트에 과도하게 의존하는 경..

Paper 2024. 11. 21. 14:12

이전 Prev 1 Next 다음

목록vision language model (1)

My Vision, Computer Vision

티스토리툴바