[NLP] gensim Word2Vec을 이용한 embedding vector train
gensim library를 활용하여 embedding vetor를 train 해보자. Data load from gensim.models import Word2Vec, KeyedVectors from torchtext import data, datasets train_iter = datasets.IMDB(split='train') train_text = [] for label, text in train_iter: train_text.append(text.lower().replace(' ','')) torchtext dataset에 있는 IMDB 데이터를 활용해 model을 train했다. Train - sg가 1이면 skip-gram, 0이면 CBOW model = Word2Vec(sentences=t..
NLP
2023. 3. 26. 19:21