'learned positional embedding' 태그의 글 목록

현대 NLP 모델의 근간이 되는 BERT의 기본적인 특징

1. pre-trained model은 왜 의미있을까? pre-training과정에서 수행한 up-stream task의 data는 별도의 label이 필요하지 않은 데이터라는 것이 하나의 강점이다. ------------------------------------------------------------------------------------------------------------------------------- 다음 단어를 맞추는 것이 label이 없다고? GPT-1이 수행한 다음 단어를 예측하는 pre-training task는 input sequence와 output sequence가 동일한 task이다. 쉽게 말해 input sequence를 차례대로 읽어들여 input sequenc..

format_list_bulleted 딥러닝/NLP
· 2022. 10. 24.
textsms

navigate_before
1
navigate_next

현대 NLP 모델의 근간이 되는 BERT의 기본적인 특징

티스토리툴바