Hong Yong Man

I am an amazing person.

A Simple Framework for Contrastive Learning of Visual Representations

less than 1 minute read

Contrastive Learning

출처 : <https://ankeshanand.com/blog/2020/01/26/contrative-self-supervised-learning.html>

두 이미지를 input으로 넣고 두 이미지가 비슷한지/아닌지를 학습시킴
input으로 넣을 이미지를 라벨링하는 방법은 1. 직접 수동으로 하는 방법 2. augmentation을 이용해서 자동으로 하는 방법

SimCLR ?

Unsupervised Learning 의 한 종류. Supervised Learning의 top-1 accuracy 에 근접. Contrastive Learning
input image 라벨링을 위해 총 3 가지 augmentaion (Random Crop, Color Distortions, Gaussian blur) 사용
- augmentation을 실험을 통해 선정, AutoAug와 같은 복잡한 augmentation은 오히려 성능을 떨어뜨렸다.
Resnet 50가 encoder로 사용 되었으며 GAP Layer의 output을 사용 (2048-dimension)
그 뒤에 2개의 MLP(Multi Level Perceptron) 사용 (128-dimensional latent space)
- latent space 참고 사이트 : https://dev-hani.tistory.com/entry/Latent-space-%EA%B0%84%EB%8B%A8-%EC%A0%95%EB%A6%AC
두 input image 간의 비슷한지/아닌지를 체크하기 위해 Cosine Similarity Function 사용
Normalized temperature-scaled cross entropy(NT-Xent) loss is used

Train

위에서 설명한 것과 같이 네트워크를 구성하며, 이를 학습시킴

How to Use?

출처 : <https://amitness.com/2020/03/illustrated-simclr/>

GAP Layer의 output에 layer를 붙여서 fine-tune 진행

Share on

X Facebook LinkedIn Bluesky

You May Also Enjoy

Long Context vs. RAG for LLMs

1 minute read

Long Context vs. RAG for LLMs 논문 요약

Speed Always Wins: A Survey on Efficient Architectures for LLMs

1 minute read

논문 개요 이 논문은 대형 언어 모델(LLM)의 효율적인 아키텍처 설계에 초점을 맞추어, 처리 속도와 비용, 자원 효율 및 실제 응용 환경에서의 실질적 성능에 대해 체계적으로 분석한다. 기존 트랜스포머 기반 모델의 한계를 넘어서는 다양한 혁신적 설계 및 최근 연구 트렌드를 폭넓게 ...

A Survey on LLM-as-a-Judge

less than 1 minute read

LLM-as-a-Judge에 대한 종합 조사

Efficient Memory Management for Large Language Model Serving with PagedAttention

1 minute read

본 논문은 대형 언어 모델(LLM) 서빙 환경에서 가장 큰 병목 중 하나인 메모리 관리 문제를 해결하기 위해 PagedAttention이라는 혁신적인 방법을 제안한다. 이 기법은 특히 KV 캐시(Key-Value Cache) 메모리 사용 최적화에 초점을 맞추며, 운영체제의 가상 메...