'Paper' 카테고리의 글 목록 (3 Page)

Notice

Recent Posts

Recent Comments

Link

« 2025/10 »
일	월	화	수	목	금	토
			1	2	3	4
5	6	7	8	9	10	11
12	13	14	15	16	17	18
19	20	21	22	23	24	25
26	27	28	29	30	31

Tags more

Archives

Today

Total

관리 메뉴

목록Paper (50)

My Vision, Computer Vision

[논문 요약/리뷰] DoRA: Weight-Decomposed Low-Rank Adaptation

DoRA: Weight-Decomposed Low-Rank AdaptationAmong the widely used parameter-efficient fine-tuning (PEFT) methods, LoRA and its variants have gained considerable popularity because of avoiding additional inference costs. However, there still often exists an accuracy gap between these methods and fullarxiv.org Author : Liu, Shih-Yang, et al.Journal : ICML 2024Keyword : DoRAPublished Date : 2024년 2월..

Paper 2025. 3. 18. 16:46

[논문 요약/리뷰] Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate ShiftTraining Deep Neural Networks is complicated by the fact that the distribution of each layer's inputs changes during training, as the parameters of the previous layers change. This slows down the training by requiring lower learning rates and careful paramarxiv.org Author : Ioffe, Sergey, and Christian Sz..

Paper 2025. 3. 17. 20:14

[논문 요약/리뷰] LORA: LOW-RANK ADAPTATION OF LARGE LANGUAGE MODELS

LoRA: Low-Rank Adaptation of Large Language ModelsAn important paradigm of natural language processing consists of large-scale pre-training on general domain data and adaptation to particular tasks or domains. As we pre-train larger models, full fine-tuning, which retrains all model parameters, becomes learxiv.orgJournal : ICLR 2022Published Date : 2021년 6월 17일Keyword : LLM, RANK Abstract모델의 크기가..

Paper 2025. 3. 11. 19:54

[논문 요약/리뷰] FuseMix: Data-Efficient Multimodal Fusion on a Single GPU

Data-Efficient Multimodal Fusion on a Single GPUThe goal of multimodal alignment is to learn a single latent space that is shared between multimodal inputs. The most powerful models in this space have been trained using massive datasets of paired inputs and large-scale computational resources, making tharxiv.orgJournal: CVPR 20204Published Date: 2023년 12월 15일Keyword: Single GPU, Vision Language ..

Paper 2025. 3. 7. 16:41

[논문 리뷰/요약] TinyLLaVA: A Framework of Small-scale Large Multimodal Models

TinyLLaVA: A Framework of Small-scale Large Multimodal ModelsWe present the TinyLLaVA framework that provides a unified perspective in designing and analyzing the small-scale Large Multimodal Models (LMMs). We empirically study the effects of different vision encoders, connection modules, language models, training darxiv.orgJournal: ArxivPublished Date: 2024년 2월 22일본 논문은 TinyLLaVA 프레임워크를 소개한다.또한..

Paper 2025. 3. 4. 19:12

[논문 요약/리뷰] SPICE: Semantic Propositional Image Caption Evaluation

SPICE: Semantic Propositional Image Caption EvaluationThere is considerable interest in the task of automatically generating image captions. However, evaluation is challenging. Existing automatic evaluation metrics are primarily sensitive to n-gram overlap, which is neither necessary nor sufficient for the taarxiv.orgJournal : ECCV 2016Published Date : 2016년 9월 16일keyword : Evaluation Metric, SP..

Paper 2025. 2. 28. 13:44

이전 Prev 1 2 3 4 5 6 ··· 9 Next 다음

목록Paper (50)

My Vision, Computer Vision

티스토리툴바