I am indexing a corpus of documents (news articles,forum posts etc.) into an Elasticsearch. To provide better search, I have also trained a SVM+Tf-Idf model classifying document to generate tags into a taxonomy e.g. News- Politics, News-Sports,Post-US Politics etc. My question: how do I weight the scores generated by the classifier to writing the document into ES?
I have been using a hackish approach, for example if I get the score of 0.7 for News-Sports, I write the ["News-Sports"] * int(score*10) i.e. write News-Sports as 7 terms into the tags field of the document.
Are there better ways of doing index-time weighting?