Notice
Recent Posts
Recent Comments
Link
| 일 | 월 | 화 | 수 | 목 | 금 | 토 |
|---|---|---|---|---|---|---|
| 1 | 2 | 3 | 4 | 5 | 6 | 7 |
| 8 | 9 | 10 | 11 | 12 | 13 | 14 |
| 15 | 16 | 17 | 18 | 19 | 20 | 21 |
| 22 | 23 | 24 | 25 | 26 | 27 | 28 |
Tags
- 원격 학습 안끊기게
- gres
- grefcoco dataset
- 딥러닝 목적함수
- clip
- object detection
- 논문 요약
- gsoc 2025
- transfuser++
- 논문 리뷰
- gsoc
- mobilenetv1
- grefcoco
- referring expression segmentation
- E2E 자율주행
- gsoc 후기
- 엔트로피란
- Object detection article
- 이미지 필터링
- res paper
- 에지 검출
- blip-2
- 1차 미분 마스크
- vlm
- 객체 검출
- res
- google summer of code
- clip adapter
- 딥러닝 엔트로피
- TransFuser
Archives
- Today
- Total
목록vit code (1)
My Vision, Computer Vision
An Image is Worth 16x16 Words: Transformers for Image Recognition at ScaleWhile the Transformer architecture has become the de-facto standard for natural language processing tasks, its applications to computer vision remain limited. In vision, attention is either applied in conjunction with convolutional networks, or used to reparxiv.orgAbstractTransformer가 사실상 NLP 분야의 표준이 되었지만 Computer vision에 ..
Paper
2024. 8. 27. 19:38