Transformer의 디코더 부분을 활용한 pre-training 모델 Bert가 19년도에 등장했다!
BERT / Pre-training of Deep Bidrectional Transformer (2019)
This post is licensed under CC BY 4.0 by the author.
Transformer의 디코더 부분을 활용한 pre-training 모델 Bert가 19년도에 등장했다!
A new version of content is available.