'Paper' 카테고리의 글 목록 (4 Page)

Notice

Recent Posts

Recent Comments

Link

« 2025/11 »
일	월	화	수	목	금	토
						1
2	3	4	5	6	7	8
9	10	11	12	13	14	15
16	17	18	19	20	21	22
23	24	25	26	27	28	29
30

Tags more

Archives

Today

Total

관리 메뉴

목록Paper (51)

My Vision, Computer Vision

[논문 요약/리뷰] SPICE: Semantic Propositional Image Caption Evaluation

SPICE: Semantic Propositional Image Caption EvaluationThere is considerable interest in the task of automatically generating image captions. However, evaluation is challenging. Existing automatic evaluation metrics are primarily sensitive to n-gram overlap, which is neither necessary nor sufficient for the taarxiv.orgJournal : ECCV 2016Published Date : 2016년 9월 16일keyword : Evaluation Metric, SP..

Paper 2025. 2. 28. 13:44

[논문 요약/리뷰] CIDEr: Consensus-based Image Description Evaluation

CIDEr: Consensus-based Image Description EvaluationJournal : CVPR 2015Published Date : 2014년 11월 20일Keyword : CIDEr score, Evaluation Metric, Microsoft CIDEr: Consensus-based Image Description EvaluationAutomatically describing an image with a sentence is a long-standing challenge in computer vision and natural language processing. Due to recent progress in object detection, attribute classifica..

Paper 2025. 2. 27. 19:30

[논문 요약/리뷰] BLEU: a Method for Automatic Evaluation of Machine Translation

BLEU | Proceedings of the 40th Annual Meeting on Association for Computational LinguisticsWe present the results of an experiment on extending the automatic method of Machine Translation evaluation BLUE with statistical weights for lexical items, such as tf.idf scores. We show that this extension gives additional information about evaluated ...dl.acm.org Published Date : 2002년 7월 1일keyword : BLE..

Paper 2025. 2. 25. 16:43

[논문 요약/리뷰] CoOp : Learning to Prompt for Vision-Language Models

Learning to Prompt for Vision-Language ModelsLarge pre-trained vision-language models like CLIP have shown great potential in learning representations that are transferable across a wide range of downstream tasks. Different from the traditional representation learning that is based mostly on discretiarxiv.org발행일 : 2021년 9월 2일저널/학회 : SPRINGER 2022ProblemCLIP과 같은 기존 VLM은 프롬프트 엔지니어링을 통한 Zero-shot t..

Paper 2025. 2. 24. 15:33

[논문 요약/리뷰] LLaMA: Open and Efficient Foundation Language Models

LLaMA: Open and Efficient Foundation Language ModelsWe introduce LLaMA, a collection of foundation language models ranging from 7B to 65B parameters. We train our models on trillions of tokens, and show that it is possible to train state-of-the-art models using publicly available datasets exclusively, withoarxiv.org발행일 : 2023. 02. 27.Meta AIProblem최근, 한정된 컴퓨터 예산(Budget)에서 LLM과 데이터셋 크기에 대한 최적화 연구..

Paper 2025. 2. 18. 13:23

[논문 리뷰/요약] BEIT: BERT Pre-Training of Image Transformers

BEiT: BERT Pre-Training of Image TransformersWe introduce a self-supervised vision representation model BEiT, which stands for Bidirectional Encoder representation from Image Transformers. Following BERT developed in the natural language processing area, we propose a masked image modeling task to prearxiv.orgProblemBEIT(Bidirectional Encoder representation from Image Transformers)를 제안한다.ViT는 CNN..

Paper 2025. 2. 14. 16:01

이전 Prev 1 2 3 4 5 6 7 ··· 9 Next 다음

목록Paper (51)

My Vision, Computer Vision

티스토리툴바