New York Tech Media
  • News
  • FinTech
  • AI & Robotics
  • Cybersecurity
  • Startups & Leaders
  • Venture Capital
No Result
View All Result
  • News
  • FinTech
  • AI & Robotics
  • Cybersecurity
  • Startups & Leaders
  • Venture Capital
No Result
View All Result
New York Tech Media
No Result
View All Result
Home AI & Robotics

Understanding Twitch Emotes in Sentiment Analysis

New York Tech Editorial Team by New York Tech Editorial Team
November 22, 2021
in AI & Robotics
0
Understanding Twitch Emotes in Sentiment Analysis
Share on FacebookShare on Twitter

The public’s growing use of emojis, emoticons, emotes, memes, GIFs and other non-verbal ways to communicate on social media platforms has, in recent years, increasingly confounded the efforts of data scientists to understand the global sociological landscape; at least, to the extent that worldwide sociological trends can be discerned from public discourse.

Though Natural Language Processing (NLP) has become a powerful tool in sentiment analysis over the last decade, the sector has difficulty not only in keeping up with an ever-evolving lexicon of slang and linguistic shortcuts across multiple languages, but also in attempting to decode the meaning of image-based posts on social media platforms such as Facebook and Twitter.

Since the limited number of highly populous social media platforms are the only truly hyperscale resource for this kind of research, it’s essential for the AI sector to at least attempt to maintain pace with it.

In July, a paper from Taiwan offered a new method to categorize user sentiment based on ‘reaction GIFs’ posted to social media threads (see image below), using a database of 30,000 tweets to develop a way to predict reactions to a post. The paper found that image-based responses are in many ways easier to gauge, since they are less likely to contain sarcasm, a notable challenge in sentiment analysis.

Researchers from Taiwan studied the use of animated reaction GIFs as ‘reductive indicators’ of sentiment in a 2021 paper.

Earlier this year, a research effort led by Boston University trained machine learning models to predict image memes that are likely to go viral on Twitter; and in August, British researchers examined the growth of emojis in comparison to emoticons (there’s a difference) on social media, compiling a large-scale 7-language dataset of pictographic Twitter sentiment.

Twitch Emotes

Now, US researchers have developed a machine learning methodology to better understand, categorize and measure the ever-evolving pseudo-lexicon of emotes on the hugely popular Twitch network.

Emotes are neologisms used on Twitch to express emotion, mood, or in-jokes. Since they are by definition new expressions, the challenge for a machine learning system is not necessarily to endlessly catalogue new emotes (which may only be used once, or else fall out of usage rapidly), but to gain a better understanding of the framework that endlessly generates them; and to develop systems capable of recognizing an emote as a ‘temporarily valid’ word or compound phrase whose emotional/political temperature may need to be gauged entirely from context.

Neighbors of the 'FeelsGoodMan' emote, whose meaning can be altered by obscure suffixes. Source: https://arxiv.org/pdf/2108.08411.pdf

Neighbors of the ‘FeelsGoodMan’ emote, whose meaning can be altered by obscure suffixes. Source: https://arxiv.org/pdf/2108.08411.pdf

The paper is titled FeelsGoodMan: Inferring Semantics of Twitch Neologisms, and comes from three researchers at Spiketrap, a social media analysis company in San Francisco.

Bait and Switch

Despite their novelty and often-brief lives, Twitch emotes frequently recycle cultural material (including older emotes) in a way that can steer sentiment analysis frameworks in the wrong direction. Tracing the shift in the meaning of an emote as it evolves can even reveal a complete inversion or negation of its original sentiment or intent.

For instance, the researchers note that the original alt-right misuse of the eponymous FeelsGoodMan Pepe-the-frog meme has almost completely lost its original political flavor in the context of its usage on Twitch.

The use of the phrase, together with an image of a cartoon frog from a 2005 comic by artist Matt Furie, became a far-right meme in the 2010s. Though Vox wrote in 2017 that the right’s appropriation of the meme had survived Furie’s self-avowed disassociation with such use, the San Francisco researchers behind the new paper have found otherwise*:

‘Furie’s cartoon frog was adopted by rightwing posters on various online forums like 4chan in the early 2010s. Since then, Furie has campaigned to reclaim the meaning of his character, and the emote has seen an upsurge in more mainstream non hate usage and positive usage on Twitch. Our results on Twitch agree, showing that “FeelsGoodMan” and its counterpart “FeelsBadMan” are mainly being used literally.’

Trouble Downstream

This kind of ‘bait and switch’ regarding the generalized ‘features’ of a meme can impede NLP research projects that have already categorized it as ‘hateful’, ‘right wing’ or ‘nationalist [US]’, and which have dumped that information into long-term open source repositories. Later NLP projects may not choose to audit the older data’s currency; may not have any practical mechanism to do so; and may not even be aware of the need.

The upshot of this is that using 2017 Twitch-based datasets to formulate a ‘political categorization ‘algorithm would attribute notable alt-right activity on Twitch, based on the frequency of the FeelsGoodMan emote. Twitch may or may not be full of alt-right influencers, but, according to the researchers of the new paper, you can’t prove it by the frog.

The ‘Pepe’ meme’s political significance appears to have been casually discarded by Twitch’s 140 million users (41% of whom are under 24), who have effectively re-stolen the work from the original thieves and painted it in their own colors, without any particular agenda.

Method and Data

The researchers found that labeled Twitch emote data was ‘virtually non-existent’, despite the conclusion of an earlier study that there are eight million total emotes, and 400,000 were present in the single week of Twitch output in the week chosen by those earlier researchers.

A 2017 study addressing emote prediction on Twitch limited itself to predicting only the top 30 Twitch emotes, scoring just 0.39 for emote prediction.

Addressing the shortfall, the San Francisco researchers took a new approach to the older data, splitting it 80/20 between training and testing, and applying ‘traditional’ machine learning methods, which had not been used before to study Twitch data. These methods included Naive Bayes (NB), Random Forest (RF), Support Vector Machine (SVM, with linear kernels), and Logistic Regression.

This approach outperformed previous Twitch sentiment baselines by 63.8%, and enabled the researchers to subsequently develop the LOOVE (Learning Out Of Vocabulary Emotions) framework, which is able to identify neologisms and ‘enrich’ existing models with these new definitions.

Architecture of the LOOVE (Learning Out Of Vocabulary Emotions) framework developed by the researchers.

Architecture of the LOOVE (Learning Out Of Vocabulary Emotions) framework developed by the researchers.

LOOVE facilitates the unsupervised training of word embeddings, and also accommodates periodic retraining and fine-tuning, obviating the need for labeled datasets, which would be logistically impractical, considering the scale of the task and the rapid evolution of emotes.

In the service of the project, the researchers trained an emote ‘Pseudo-Dictionary’ on an unlabeled Twitch dataset, in the process generating 444,714 embeddings of words, emotes, emojis and emoticons.

Further, they augmented a VADER lexicon with an emoji/emoticon lexicon, and in addition to the aforementioned EC dataset, also exploited three other publicly available datasets for ternary sentiment classification, from Twitter, Rotten Tomatoes and a sampled YELP dataset.

Given the great variety of methodologies and datasets used in the study, the results are variegated, but the researchers assert that their best-case baseline outperformed the nearest prior metric by 7.36 percentage points.

The researchers consider that the ongoing value of the project is the development of LOOVE, based on word-to-vector (W2V) embeddings trained on over 313 million Twitch chat messages with the help of K-Nearest Neighbor (KNN).

The authors conclude:

‘A driving feature behind the framework is a emote pseudo-dictionary which can be used to derive sentiment for unknown emotes. Using this emote pseudo-dictionary, we created a sentiment table for 22,507 emotes. This is the first case of emote understanding on this scale.’

 

* My conversion of inline citations to hyperlinks.

 

Credit: Source link

Previous Post

Android Messages update handles Apple iMessage reactions properly

Next Post

Crypto startup MoonPay hits $3.4 billion valuation in new funding round

New York Tech Editorial Team

New York Tech Editorial Team

New York Tech Media is a leading news publication that aims to provide the latest tech news, fintech, AI & robotics, cybersecurity, startups & leaders, venture capital, and much more!

Next Post

Crypto startup MoonPay hits $3.4 billion valuation in new funding round

  • Trending
  • Comments
  • Latest
Meet the Top 10 K-Pop Artists Taking Over 2024

Meet the Top 10 K-Pop Artists Taking Over 2024

March 17, 2024
Panther for AWS allows security teams to monitor their AWS infrastructure in real-time

Many businesses lack a formal ransomware plan

March 29, 2022
Zach Mulcahey, 25 | Cover Story | Style Weekly

Zach Mulcahey, 25 | Cover Story | Style Weekly

March 29, 2022
10 Raunchy Movies on Netflix You Won’t Regret Watching

10 Raunchy Movies on Netflix You Won’t Regret Watching

May 20, 2024
How To Pitch The Investor: Ronen Menipaz, Founder of M51

How To Pitch The Investor: Ronen Menipaz, Founder of M51

March 29, 2022
Japanese Space Industry Startup “Synspective” Raises US $100 Million in Funding

Japanese Space Industry Startup “Synspective” Raises US $100 Million in Funding

March 29, 2022
Startups On Demand: renovai is the Netflix of Online Shopping

Startups On Demand: renovai is the Netflix of Online Shopping

2
Robot Company Offers $200K for Right to Use One Applicant’s Face and Voice ‘Forever’

Robot Company Offers $200K for Right to Use One Applicant’s Face and Voice ‘Forever’

1
Menashe Shani Accessibility High Tech on the low

Revolutionizing Accessibility: The Story of Purple Lens

1

Netgear announces a $1,500 Wi-Fi 6E mesh router

0
These apps let you customize Windows 11 to bring the taskbar back to life

These apps let you customize Windows 11 to bring the taskbar back to life

0
This bipedal robot uses propeller arms to slackline and skateboard

This bipedal robot uses propeller arms to slackline and skateboard

0
laptop on glass table

Automat-it Cuts Deployment Friction as Monce Scales AI Order Processing on AWS

April 13, 2026
Lee's Famous Recipe Chicken

Why Lee’s Famous Recipe Chicken Is Betting on Hi Auto to Quietly Rewire the Drive-Thru

April 9, 2026
computer generated image of letters

San Francisco Tribune Lists 11 HumanX Startups Moving AI Closer to the Operating Core

April 8, 2026
Impala CEO and Highrise AI CEO

The Industrialization of AI Infrastructure: What Impala and Highrise AI Reveal About the Next Scaling Frontier

April 7, 2026
Employee Time Tracking

What is an Employee Time Tracking Solution? A Definite Guide for 2026

March 31, 2026
Voltify founders

Voltify Raises $30 Million Seed Round as It Challenges $1 Trillion Rail Electrification Model

March 31, 2026

Recommended

laptop on glass table

Automat-it Cuts Deployment Friction as Monce Scales AI Order Processing on AWS

April 13, 2026
Lee's Famous Recipe Chicken

Why Lee’s Famous Recipe Chicken Is Betting on Hi Auto to Quietly Rewire the Drive-Thru

April 9, 2026
computer generated image of letters

San Francisco Tribune Lists 11 HumanX Startups Moving AI Closer to the Operating Core

April 8, 2026
Impala CEO and Highrise AI CEO

The Industrialization of AI Infrastructure: What Impala and Highrise AI Reveal About the Next Scaling Frontier

April 7, 2026

Categories

  • AI & Robotics
  • Benzinga
  • Cybersecurity
  • FinTech
  • New York Tech
  • News
  • Startups & Leaders
  • Venture Capital

Tags

AI AI QSRs Allseated Automat-it AWS B2B marketing Business CISO CISO Whisperer Collaborations Companies To Watch cryptocurrency Cybersecurity Entrepreneur Fetcherr Finance FINQ Fintech Funding Announcement hi-tech Hi Auto Impala Investing Investors investorsummit Israel israelitech Leaders LinkedIn Leaders Metaverse Mindset Minnesota omri hurwitz PointFive PR QSR Real Estate start- up startupnation Startups Startups On Demand Tech Tech leaders Unlimited Robotics VC
  • Contact Us
  • Privacy Policy
  • Terms and conditions

© 2024 All Rights Reserved - New York Tech Media

No Result
View All Result
  • News
  • FinTech
  • AI & Robotics
  • Cybersecurity
  • Startups & Leaders
  • Venture Capital

© 2024 All Rights Reserved - New York Tech Media