'blip-2' 태그의 글 목록

Notice

Recent Posts

Recent Comments

Link

« 2026/04 »
일	월	화	수	목	금	토
			1	2	3	4
5	6	7	8	9	10	11
12	13	14	15	16	17	18
19	20	21	22	23	24	25
26	27	28	29	30

Tags more

Archives

Today

Total

관리 메뉴

목록blip-2 (3)

My Vision, Computer Vision

[논문 리뷰/요약] BLIP-2: Bootstrapping Language-Image Pre-training

BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language ModelsThe cost of vision-and-language pre-training has become increasingly prohibitive due to end-to-end training of large-scale models. This paper proposes BLIP-2, a generic and efficient pre-training strategy that bootstraps vision-language pre-training from oarxiv.org Problem대부분의 State-of-the-art V..

Paper 2025. 2. 7. 18:35

[졸업 논문/프로젝트] 딥러닝 알고리즘을 이용한 상황 검색 가능한 비디오 플레이어 구현

Youtube 영상 장면 검색(Colab)https://github.com/gyuilLim/youtube-scene-search-with-text GitHub - gyuilLim/youtube-scene-search-with-text: Finding scenes that you want by text automaticallyFinding scenes that you want by text automatically - gyuilLim/youtube-scene-search-with-textgithub.com 장면 검색 비디오 플레이어https://github.com/gyuilLim/Scene-search-video-player GitHub - gyuilLim/Scene-search-video-player: ..

WorkPlace 2024. 12. 4. 20:13

이미지에서 원하는 텍스트 뽑아내기, 이미지 캡셔닝 BLIP-2(Colab 가능)

LAVIS/projects/blip2 at main · salesforce/LAVIS LAVIS - A One-stop Library for Language-Vision Intelligence - salesforce/LAVIS github.com 이 글은 위 Github을 참고하여 작성했습니다. 본 글에서는 이미지 캡셔닝(Image2 Text) BLIP-2 모델 사용법을 알려드리려고 합니다. BLIP-2은 입력 이미지에 대한 Text를 출력해주는데, 사용자가 원하는 형태의 답변을 지정해 줄 수 있습니다. Google Colab T4(15GB) 환경에서 실행가능하며, 약 12GB 정도의 GPU memory를 사용합니다. 1. Install BLIP-2 패키지를 설치해줍니다. BLIP-2은 salesforc..

WorkPlace 2024. 3. 28. 15:40

이전 Prev 1 Next 다음

목록blip-2 (3)

My Vision, Computer Vision

티스토리툴바