본문 바로가기

My Vision, Computer Vision

검색하기
My Vision, Computer Vision
프로필사진 gyuilLim

  • 분류 전체보기 (88)
    • Paper (53)
    • 환경 설정 (10)
    • WorkPlace (6)
    • 공부 (17)
Guestbook
Notice
Recent Posts
Recent Comments
Link
«   2026/02   »
일 월 화 수 목 금 토
1 2 3 4 5 6 7
8 9 10 11 12 13 14
15 16 17 18 19 20 21
22 23 24 25 26 27 28
Tags
  • 엔트로피란
  • clip
  • E2E 자율주행
  • gres
  • 객체 검출
  • 논문 요약
  • Object detection article
  • gsoc 2025
  • clip adapter
  • 이미지 필터링
  • gsoc 후기
  • 원격 학습 안끊기게
  • object detection
  • referring expression segmentation
  • google summer of code
  • transfuser++
  • TransFuser
  • grefcoco dataset
  • vlm
  • mobilenetv1
  • 논문 리뷰
  • 1차 미분 마스크
  • 딥러닝 엔트로피
  • res
  • res paper
  • gsoc
  • 딥러닝 목적함수
  • blip-2
  • 에지 검출
  • grefcoco
more
Archives
Today
Total
관리 메뉴
  • 글쓰기
  • 방명록
  • RSS
  • 관리

목록gsva (1)

My Vision, Computer Vision

[논문 요약/리뷰] GSVA: Generalized Segmentation via Multimodal Large Language Models

GSVA: Generalized Segmentation via Multimodal Large Language ModelsGeneralized Referring Expression Segmentation (GRES) extends the scope of classic RES to refer to multiple objects in one expression or identify the empty targets absent in the image. GRES poses challenges in modeling the complex spatial relationships of tarxiv.orgAuthor: Xia, Zhuofan, et al.Journal: CVPR 20204Published Date: 202..

Paper 2025. 6. 18. 13:44
이전 Prev 1 Next 다음

Blog is powered by kakao / Designed by Tistory

티스토리툴바