Picking groups of sentences by topics and features

Multi tool use
Multi tool use
The name of the pictureThe name of the pictureThe name of the pictureClash Royale CLAN TAG#URR8PPP


Picking groups of sentences by topics and features



My plan is to vectorise all these sentences (with glove for example). Start at sent #1; add sentences until none of the distances improve. Repeat with sent #2. At the end, I could have top 5/10 groups of sentences for each topic. Unfortunately, I'm not taking into consideration my binary features. Do I concatenate them to my word vectors and use them for computing distance?



Do I train a multi-class supervised model (three topics and other?) with a small imbalanced dataset where I classify each sentence separately. Use this model to make predictions on each sentence and build groups based on average predictions? I don't like this way.



Or do I stage this as two problems? Stage 1, get all the distances for each potential group of sentences with a certain minimum threshold. Stage 2 classify each group on these binary features with the topic similarities?



Can somebody point me in the right direction?









By clicking "Post Your Answer", you acknowledge that you have read our updated terms of service, privacy policy and cookie policy, and that your continued use of the website is subject to these policies.

8bsvVpWX,HMpen8aw6WB e QCRVL67uyy1Fd8fa99Wk7Ip x vACSCo,sE,bUsFdqHF5VD2iBdNEawwAuZ
JKXp3fJIw2Q6N kG8 n,zU6B8fh,P W23nG7DYpUBS6c,n3 6HU4c,KCse,ykAzptO inLs,7s Xwa

Popular posts from this blog

Makefile test if variable is not empty

Will Oldham

Visual Studio Code: How to configure includePath for better IntelliSense results