<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0">
<channel>
<title>Ask Ghassem - Recent questions tagged sentiment-analysis</title>
<link>https://ask.ghassem.com/tag/sentiment-analysis</link>
<description>Powered by Question2Answer</description>
<item>
<title>Very short text classification when category text should be replaced by another category text?</title>
<link>https://ask.ghassem.com/980/classification-category-should-replaced-another-category</link>
<description>&lt;div style=&quot;max-width:800px&quot;&gt;
&lt;div style=&quot;color:#1A1A1B&quot;&gt;
&lt;p&gt;I need some tool to classify articles based on short category text which consists of two or three words separated by &#039;-&#039;. The RSS/XML tag content is for example:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;Foreign - News&lt;/p&gt;
&lt;/blockquote&gt;

&lt;blockquote&gt;
&lt;p&gt;Football - Foreign&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;I created my own categories in DB and now I need to classify categories from parsed RSS of this news source, so it fits news categories defined by me.&lt;/p&gt;

&lt;p&gt;I would, for example need all articles containing category &quot;football&quot; to be identified as a category &lt;em&gt;Sport&lt;/em&gt; but sometimes those categories XML tags contains exact match like &lt;em&gt;Foreign - News&lt;/em&gt; should belong in the DB to category defined by me as &lt;em&gt;Foreign&lt;/em&gt;.&lt;/p&gt;

&lt;p&gt;Since I used only trained decision trees frameworks from AI so for another project so far, I would like to hear advice about probably AI based approach, technique or particular framework I can use to solve this problem. I don&#039;t want to get into a dead-end street by my own poor, in the field of AI not very experienced decision.&lt;/p&gt;

&lt;p&gt;While it can be solved by many ifs and &#039;contains&#039; function, it seems to me like not a very good solution.&lt;/p&gt;

&lt;p&gt;TLDR; I need basically something like &quot;clever, flexible and universal if-elseif&quot;.&lt;/p&gt;

&lt;p&gt;NOTE: I can also use article description text, if that would be necessary but it seems to me that this former category text is &amp;nbsp;unambiguous enough for this kind of problem.&lt;/p&gt;
&lt;/div&gt;
&lt;/div&gt;</description>
<category>Artificial Intelligence</category>
<guid isPermaLink="true">https://ask.ghassem.com/980/classification-category-should-replaced-another-category</guid>
<pubDate>Thu, 11 Feb 2021 12:48:47 +0000</pubDate>
</item>
<item>
<title>Binary Classification and neutral tag</title>
<link>https://ask.ghassem.com/978/binary-classification-and-neutral-tag</link>
<description>&lt;p&gt;I am trying to create a sentiment analysis model using binary classification as loss.I have a batch of tweets that some of them are tagged as positive (labeled as 1)&amp;nbsp;and&amp;nbsp;negative (labeled as 0).I manage to gather some tweets that are tagged as neutral but there are less&amp;nbsp; tweets than positive and negative.My thinking is to tag them with 0.5 to balance the classification probability.Is this legit?&lt;/p&gt;

&lt;div id=&quot;gtx-trans&quot; style=&quot;position: absolute; left: 460px; top: 54px;&quot;&gt;
&lt;div class=&quot;gtx-trans-icon&quot;&gt;&amp;nbsp;&lt;/div&gt;
&lt;/div&gt;</description>
<category>Deep Learning</category>
<guid isPermaLink="true">https://ask.ghassem.com/978/binary-classification-and-neutral-tag</guid>
<pubDate>Sat, 30 Jan 2021 10:08:01 +0000</pubDate>
</item>
<item>
<title>&quot;Rare words&quot; on vocabulary</title>
<link>https://ask.ghassem.com/977/rare-words-on-vocabulary</link>
<description>I am trying to create a sentiment analysis model and I have a question.&lt;br /&gt;
&lt;br /&gt;
After I preprocessed my tweets and created my vocabulary I&amp;#039;ve noticed that I have words that appear less than 5 times in my dataset (Also there are many of them that appear 1 time). Many of them are real words and not gibberish. My thinking is that if I keep those words then they will get wrong &amp;quot;sentimental&amp;quot; weights and gonna make my model worse.&lt;br /&gt;
Is my thinking right or am I missing something?&lt;br /&gt;
&lt;br /&gt;
My vocab size is around 40000 words and those that are &amp;quot;rare&amp;quot; are around 10k.Should I &amp;quot;sacrifice&amp;quot; them?&lt;br /&gt;
&lt;br /&gt;
Thanks in advance.</description>
<category>Deep Learning</category>
<guid isPermaLink="true">https://ask.ghassem.com/977/rare-words-on-vocabulary</guid>
<pubDate>Sat, 30 Jan 2021 09:57:31 +0000</pubDate>
</item>
<item>
<title>My GloVe word embeddings contain sentiment?</title>
<link>https://ask.ghassem.com/972/my-glove-word-embeddings-contain-sentiment</link>
<description>&lt;p&gt;I&#039;ve been researching sentiment analysis with word embeddings. I read papers that state that word embeddings ignore sentiment information of the words in the text. One paper states that among the top 10 words that are semantically similar, around 30 percent of words have opposite polarity e.g. happy - sad.&lt;/p&gt;

&lt;p&gt;So, I computed word embeddings on my dataset (Amazon reviews) with the GloVe algorithm in R. Then, I looked at the most similar words with cosine similarity and I found that actually every word is sentimentally similar. (E.g. beautiful - lovely - gorgeous - pretty - nice - love). Therefore, I was wondering how this is possible since I expected the opposite from reading several papers. What could be the reason for my findings?&lt;/p&gt;

&lt;p&gt;Two of the many papers I read:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Yu, L. C., Wang, J., Lai, K. R. &amp;amp; Zhang, X. (2017). Refining Word Embeddings Using Intensity Scores for Sentiment Analysis. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 26(3), 671-681.&lt;/li&gt;
&lt;li&gt;Tang, D., Wei, F., Yang, N., Zhou, M., Liu, T. &amp;amp; Qin, B. (2014). Learning Sentiment-Specific Word Embedding for Twitter Sentiment Classification. Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 1: Long Papers, 1555-1565&lt;/li&gt;
&lt;/ul&gt;</description>
<category>General</category>
<guid isPermaLink="true">https://ask.ghassem.com/972/my-glove-word-embeddings-contain-sentiment</guid>
<pubDate>Sun, 03 Jan 2021 14:09:37 +0000</pubDate>
</item>
<item>
<title>How to perform sentiment analysis in NLP?</title>
<link>https://ask.ghassem.com/412/how-to-perform-sentiment-analysis-in-nlp</link>
<description>If trying to read text and need to finalize texts as good, bad , ugly or any such buckets, where to start? What sentiment functions to use?</description>
<category>Machine Learning</category>
<guid isPermaLink="true">https://ask.ghassem.com/412/how-to-perform-sentiment-analysis-in-nlp</guid>
<pubDate>Wed, 17 Oct 2018 00:45:12 +0000</pubDate>
</item>
</channel>
</rss>