TY - GEN
T1 - EmojiNet
T2 - 11th International Conference on Web and Social Media, ICWSM 2017
AU - Wijeratne, Sanjaya
AU - Balasuriya, Lakshika
AU - Sheth, Amit
AU - Doran, Derek
N1 - Publisher Copyright:
© Copyright 2017, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved.
PY - 2017
Y1 - 2017
N2 - This paper presents the release of EmojiNet, the largest machine-readable emoji sense inventory that links Unicode emoji representations to their English meanings extracted from the Web. EmojiNet is a dataset consisting of: (i) 12,904 sense labels over 2,389 emoji, which were extracted from the web and linked to machine-readable sense definitions seen in BabelNet; (ii) context words associated with each emoji sense, which are inferred through word embedding models trained over Google News corpus and a Twitter message corpus for each emoji sense definition; and (iii) recognizing discrepancies in the presentation of emoji on different platforms, specification of the most likely platformbased emoji sense for a selected set of emoji. The dataset is hosted as an open service with a REST API and is available at http://emojinet.knoesis.org/. The development of this dataset, evaluation of its quality, and its applications including emoji sense disambiguation and emoji sense similarity are discussed.
AB - This paper presents the release of EmojiNet, the largest machine-readable emoji sense inventory that links Unicode emoji representations to their English meanings extracted from the Web. EmojiNet is a dataset consisting of: (i) 12,904 sense labels over 2,389 emoji, which were extracted from the web and linked to machine-readable sense definitions seen in BabelNet; (ii) context words associated with each emoji sense, which are inferred through word embedding models trained over Google News corpus and a Twitter message corpus for each emoji sense definition; and (iii) recognizing discrepancies in the presentation of emoji on different platforms, specification of the most likely platformbased emoji sense for a selected set of emoji. The dataset is hosted as an open service with a REST API and is available at http://emojinet.knoesis.org/. The development of this dataset, evaluation of its quality, and its applications including emoji sense disambiguation and emoji sense similarity are discussed.
KW - Social networking (online)
KW - Context-word
KW - ITS applications
KW - Most likely
KW - News corpora
KW - Open services
KW - Sense inventories
KW - Unicodes
UR - http://www.scopus.com/inward/record.url?scp=85029446461&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85029446461&partnerID=8YFLogxK
UR - https://corescholar.libraries.wright.edu/knoesis/1118
U2 - 10.1609/icwsm.v11i1.14857
DO - 10.1609/icwsm.v11i1.14857
M3 - Conference contribution
AN - SCOPUS:85029446461
T3 - Proceedings of the 11th International Conference on Web and Social Media, ICWSM 2017
SP - 437
EP - 446
BT - Proceedings of the 11th International Conference on Web and Social Media, ICWSM 2017
PB - AAAI Press
Y2 - 15 May 2017 through 18 May 2017
ER -