Classification of Twitter Accounts into Automated Agents and Human Users
- 👤 Speaker: Zafar Gilani (Computer Lab)
- 📅 Date & Time: Thursday 13 July 2017, 15:00 - 16:00
- 📍 Venue: FW26, Computer Laboratory, William Gates Building
Abstract
Online social networks (OSNs) have seen a remark- able rise in the presence of surreptitious automated accounts. Massive human user-base and business-supportive operating model of social networks, such as Twitter, facilitates the creation of automated agents. In this paper we outline a systematic methodology and train a classifier to categorise Twitter accounts into ‘automated’ and ‘human’ users. To improve classification accuracy we employ a set of novel steps. First, we divide the dataset into four popularity bands to compensate for differences in types of accounts. Second, we create a large ground truth dataset using human annotations and extract relevant features from raw tweets. To judge accuracy of the procedure we calculate agreement among human annotators as well as with a bot detection tool. We then apply a Random Forests classifier that achieves an accuracy close to or surpassing human agreement. Finally, as a concluding step we perform tests to measure the efficacy of our results.
Series This talk is part of the Computer Laboratory Systems Research Group Seminar series.
Included in Lists
- All Talks (aka the CURE list)
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge talks
- Chris Davis' list
- CL's SRG seminar
- Computer Laboratory Systems Research Group Seminar
- Department of Computer Science and Technology talks and seminars
- FW26, Computer Laboratory, William Gates Building
- Interested Talks
- ndk22's list
- ob366-ai4er
- rp587
- School of Technology
- Trust & Technology Initiative - interesting events
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)

Zafar Gilani (Computer Lab)
Thursday 13 July 2017, 15:00-16:00