Walid Magdy scite author profile

In this paper, we present our work on detecting abusive language on Arabic social media. We extract a list of obscene words and hashtags using common patterns used in offensive and rude communications. We also classify Twitter users according to whether they use any of these words or not in their tweets. We expand the list of obscene words using this classification, and we report results on a newly created dataset of classified Arabic tweets (obscene, offensive, and clean). We make this dataset freely available for research, in addition to the list of obscene words and hashtags. We are also publicly releasing a large corpus of classified user comments that were deleted from a popular Arabic news site due to violations the site's rules and guidelines.

show abstract

Content and Network Dynamics Behind Egyptian Political Polarization on Twitter

Borge-Holthoefer

et al. 2015

View full text Add to dashboard Cite

There is little doubt about whether social networks play a role in modern protests. This agreement has triggered an entire research avenue, in which social structure and content analysis have been central -but are typically exploited separately.Here, we combine these two approaches to shed light on the opinion evolution dynamics in Egypt during the summer of 2013 along two axes (Islamist/Secularist, pro/anti-military intervention). We intend to find traces of opinion changes in Egypt's population, paralleling those in the international community -which oscillated from sympathetic to condemnatory as civil clashes grew. We find little evidence of people "switching" sides, along with clear changes in volume in both pro-and anti-military camps.Our work contributes new insights into the dynamics of large protest movements, specially in the aftermath of the main events -rather unattended previously. It questions the standard narrative concerning a simplistic mapping between Secularist/pro-military and Islamist/anti-military.Finally, our conclusions provide empirical validation to sociological models regarding the behavior of individuals in conflictive contexts. *

show abstract

SemEval-2016 Task 3: Community Question Answering

Nakov¹,

Màrquez²,

Moschitti³

et al. 2016

145

View full text Add to dashboard Cite

show abstract

#ISISisNotIslam or #DeportAllMuslims?

Magdy

Darwish

Abokhodair

et al. 2016

View full text Add to dashboard Cite

QCRI: Answer Selection for Community Question Answering - Experiments for Arabic and English

Nicosia

Filice

Barrón-Cedeño

et al. 2015

View full text Add to dashboard Cite

This paper describes QCRI's participation in SemEval-2015 Task 3 "Answer Selection in Community Question Answering", which targeted real-life Web forums, and was offered in both Arabic and English. We apply a supervised machine learning approach considering a manifold of features including among others word n-grams, text similarity, sentiment analysis, the presence of specific words, and the context of a comment. Our approach was the best performing one in the Arabic subtask and the third best in the two English subtasks.

show abstract

#FailedRevolutions: Using Twitter to study the antecedents of ISIS support

Magdy¹,

Darwish²,

Weber³

2016

View full text Add to dashboard Cite

Within a fairly short amount of time, the Islamic State of Iraq and Syria (ISIS) has managed to put large swaths of land in Syria and Iraq under their control. To many observers, the sheer speed at which this "state" was established was dumbfounding. To better understand the roots of this organization and its supporters we present a study using data from Twitter. We start by collecting large amounts of Arabic tweets referring to ISIS and classify them into pro-ISIS and anti-ISIS. This classification turns out to be easily done simply using the name variants used to refer to the organization: the full name and the description as "state" is associated with support, whereas abbreviations usually indicate opposition. We then "go back in time" by analyzing the historic timelines of both users supporting and opposing and look at their pre-ISIS period to gain insights into the antecedents of support. To achieve this, we build a classifier using pre-ISIS data to "predict", in retrospect, who will support or oppose the group. The key story that emerges is one of frustration with failed Arab Spring revolutions. ISIS supporters largely differ from ISIS opposition in that they refer a lot more to Arab Spring uprisings that failed. We also find temporal patterns in the support and opposition which seems to be linked to major news, such as reported territorial gains, reports on gruesome acts of violence, and reports on airstrikes and foreign intervention. 1 http://www.ft.com/cms/s/2/ 97130d46-7952-11e4-9567-00144feabdc0.html 2 http://www.newsweek.com/german-journalist-returns-time-isis-chilli

show abstract

Your Stance is Exposed! Analysing Possible Factors for Stance Detection on Social Media

Aldayel

Magdy

2019

Proc. ACM Hum.-Comput. Interact.

View full text Add to dashboard Cite

To what extent user's stance towards a given topic could be inferred? Most of the studies on stance detection have focused on analysing user's posts on a given topic to predict the stance. However, the stance in social media can be inferred from a mixture of signals that might reflect user's beliefs including posts and online interactions. This paper examines various online features of users to detect their stance towards different topics. We compare multiple set of features, including on-topic content, network interactions, user's preferences, and online network connections. Our objective is to understand the online signals that can reveal the users' stance. Experimentation is applied on tweets dataset from the SemEval stance detection task, which covers five topics. Results show that stance of a user can be detected with multiple signals of user's online activity, including their posts on the topic, the network they interact with or follow, the websites they visit, and the content they like. The performance of the stance modelling using different network features are comparable with the state-of-the-art reported model that used textual content only. In addition, combining network and content features leads to the highest reported performance to date on the SemEval dataset with F-measure of 72.49%.We further present an extensive analysis to show how these different set of features can reveal stance. Our findings have distinct privacy implications, where they highlight that stance is strongly embedded in user's online social network that, in principle, individuals can be profiled from their interactions and connections even when they do not post about the topic.

show abstract

Improved Stance Prediction in a User Similarity Feature Space

Darwish¹,

Magdy

2017

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Walid Magdy

Abusive Language Detection on Arabic Social Media

Content and Network Dynamics Behind Egyptian Political Polarization on Twitter

SemEval-2016 Task 3: Community Question Answering

#ISISisNotIslam or #DeportAllMuslims?

QCRI: Answer Selection for Community Question Answering - Experiments for Arabic and English

#FailedRevolutions: Using Twitter to study the antecedents of ISIS support

Your Stance is Exposed! Analysing Possible Factors for Stance Detection on Social Media

Improved Stance Prediction in a User Similarity Feature Space

Contact Info

Product

Resources

About