일 | 월 | 화 | 수 | 목 | 금 | 토 |
---|---|---|---|---|---|---|
1 | 2 | 3 | 4 | 5 | ||
6 | 7 | 8 | 9 | 10 | 11 | 12 |
13 | 14 | 15 | 16 | 17 | 18 | 19 |
20 | 21 | 22 | 23 | 24 | 25 | 26 |
27 | 28 | 29 | 30 |
- object detection
- gres: generalized referring expression segmentation 논문 리뷰
- clip adapter
- 논문 리뷰
- referring expression segmentation
- grefcoco dataset
- 1차 미분 마스크
- 딥러닝 엔트로피
- 딥러닝 목적함수
- gres
- blip-2
- vlm
- 에지 검출
- vlm hallucination paper
- Object detection article
- grefcoco
- 논문 요약
- gres 논문
- polling-based object probing evaluation
- gres 논문 리뷰
- mobilenetv1
- gres: generalized referring expression segmentation 논문
- vlm 환각이란
- gres: generalized referring expression segmentation
- 이미지 필터링
- 객체 검출
- clip
- 기계학습
- 원격 학습 안끊기게
- 엔트로피란
- Today
- Total
목록Paper (42)
My Vision, Computer Vision

LoRA: Low-Rank Adaptation of Large Language ModelsAn important paradigm of natural language processing consists of large-scale pre-training on general domain data and adaptation to particular tasks or domains. As we pre-train larger models, full fine-tuning, which retrains all model parameters, becomes learxiv.orgJournal : ICLR 2022Published Date : 2021년 6월 17일Keyword : LLM, RANK Abstract모델의 크기가..

Data-Efficient Multimodal Fusion on a Single GPUThe goal of multimodal alignment is to learn a single latent space that is shared between multimodal inputs. The most powerful models in this space have been trained using massive datasets of paired inputs and large-scale computational resources, making tharxiv.orgJournal: CVPR 20204Published Date: 2023년 12월 15일Keyword: Single GPU, Vision Language ..

TinyLLaVA: A Framework of Small-scale Large Multimodal ModelsWe present the TinyLLaVA framework that provides a unified perspective in designing and analyzing the small-scale Large Multimodal Models (LMMs). We empirically study the effects of different vision encoders, connection modules, language models, training darxiv.orgJournal: ArxivPublished Date: 2024년 2월 22일본 논문은 TinyLLaVA 프레임워크를 소개한다.또한..

SPICE: Semantic Propositional Image Caption EvaluationThere is considerable interest in the task of automatically generating image captions. However, evaluation is challenging. Existing automatic evaluation metrics are primarily sensitive to n-gram overlap, which is neither necessary nor sufficient for the taarxiv.orgJournal : ECCV 2016Published Date : 2016년 9월 16일keyword : Evaluation Metric, SP..

CIDEr: Consensus-based Image Description EvaluationJournal : CVPR 2015Published Date : 2014년 11월 20일Keyword : CIDEr score, Evaluation Metric, Microsoft CIDEr: Consensus-based Image Description EvaluationAutomatically describing an image with a sentence is a long-standing challenge in computer vision and natural language processing. Due to recent progress in object detection, attribute classifica..

BLEU | Proceedings of the 40th Annual Meeting on Association for Computational LinguisticsWe present the results of an experiment on extending the automatic method of Machine Translation evaluation BLUE with statistical weights for lexical items, such as tf.idf scores. We show that this extension gives additional information about evaluated ...dl.acm.org Published Date : 2002년 7월 1일keyword : BLE..