My Vision, Computer Vision

Notice

Recent Posts

Recent Comments

Link

« 2025/08 »
일	월	화	수	목	금	토
					1	2
3	4	5	6	7	8	9
10	11	12	13	14	15	16
17	18	19	20	21	22	23
24	25	26	27	28	29	30
31

Tags more

Archives

Today

Total

관리 메뉴

목록전체 글 (84)

My Vision, Computer Vision

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

ALBERT: A Lite BERT for Self-supervised Learning of Language RepresentationsIncreasing model size when pretraining natural language representations often results in improved performance on downstream tasks. However, at some point further model increases become harder due to GPU/TPU memory limitations and longer training times. Toarxiv.orgAuthor : Lan, Zhenzhong, et al.Journal : ICLR 2020Keyword ..

Paper 2025. 5. 16. 10:34

구글 서머 오브 코드(Google Summer of Code) 합격 후기

GSoC(Google Summer of Code) 2025구글 서머 오브 코드는 여름에 진행되는 오픈 소스 프로젝트이다.여러 해외 기업들이 프로젝트를 들고오면 프로젝트 당 학생 한명씩 맡고, 해당 기업 멘토들이 도와주고 피드백을 주는 그런 시스템이다.Organizations List를 보면 AI, Security, Web 등 필드 별로 구분되어 있고 AI 분야에는 무려 딥마인드도 있다.나는 인텔의 OpenVINO에 지원했는데(총 3개까지 가능한데 1개만 함), 딥러닝 모델을 간편하게 사용할 수 있게 해주는 툴킷이다.컨택부터 지원까지의 과정은 기업마다, 프로젝트 멘토마다 다른데 내 경험을 바탕으로 후기를 남긴다..프로젝트 공개 및 컨택(2/27 ~ 3/24)GSoC 2025는 2월 27일에 기업 별 프로젝..

WorkPlace 2025. 5. 9. 19:59

[논문 요약/리뷰] Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabulary Segmentation

Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabulary SegmentationOpen-Vocabulary Segmentation (OVS) aims at segmenting images from free-form textual concepts without predefined training classes. While existing vision-language models such as CLIP can generate segmentation masks by leveraging coarse spatial information frarxiv.orgAuthor : Barsellotti, Luca, ..

Paper 2025. 5. 2. 19:43

[논문 요약/리뷰] LAVT: Language-Aware Vision Transformer for Referring Image Segmentation

LAVT: Language-Aware Vision Transformer for Referring Image SegmentationReferring image segmentation is a fundamental vision-language task that aims to segment out an object referred to by a natural language expression from an image. One of the key challenges behind this task is leveraging the referring expression for highligharxiv.orgAuthor : Yang, Zhao, et alJournal : CVPR 2022Keyword : PWAM, ..

Paper 2025. 5. 2. 18:05

[논문 요약/리뷰] A Survey on Hallucination in Large Vision-Language Models

A Survey on Hallucination in Large Vision-Language ModelsRecent development of Large Vision-Language Models (LVLMs) has attracted growing attention within the AI landscape for its practical implementation potential. However, ``hallucination'', or more specifically, the misalignment between factual visual contentarxiv.org Author : Liu, Hanchao, et al.Journal : ArxivKeyword : Survey, Vision Langau..

Paper 2025. 4. 24. 18:26

[논문 리뷰/요약] GRES: Generalized Referring Expression Segmentation

GRES: Generalized Referring Expression SegmentationReferring Expression Segmentation (RES) aims to generate a segmentation mask for the object described by a given language expression. Existing classic RES datasets and methods commonly support single-target expressions only, i.e., one expression refers toarxiv.orgAuthor : Liu, Chang, Henghui Ding, and Xudong Jiang.Journal : CVPR 2023Keyword : Re..

Paper 2025. 4. 16. 12:38

이전 Prev 1 2 3 4 5 ··· 14 Next 다음

목록전체 글 (84)

My Vision, Computer Vision

티스토리툴바