Documentos de Académico
Documentos de Profesional
Documentos de Cultura
Database Systems
Lecture 19
Big Data and
Big Data Analytics (2)
Notes
14 October 2015
Date
Day
10
13 Oct
Tues
14 Oct
Wed
16 Oct
Fri
20 Oct
Tues
21 Oct
Wed
23 Oct
Fri
27 Oct
Tue
28 Oct
Wed
11
12
Topic
L18:
L20:
L21:
No prac
project day for Computer Science
L22: Class test 3: data analytics
Presentation for topic 18
L23: Data analytics: Data mining
Presentation for topics 19, 20
Outline
Last lecture:
1. Technologies supporting Big data storage & analytics
MapReduce computation framework
NoSQL big database management systems (BDMSes)
NewSQL big database management systems (BDMSes)
This lecture:
Case study:
Analysis of microblogs data: Twitter
sentiment analysis of microblogs
3
Twitter statistics
More recently:
Twitter adoption in SA
Adoption in South Africa:
Businesses
governments
non-government
organisations have a Twitter & Facebook presence.
Twitter analytics
Two approaches to analysis:
(1) Online analytics:
(i) Subscribe to a service for social media data analytics
(ii) use service to obtain analysis reports & Twitter data
Sentiment140: http://www.sentiment140/
available languages
10
1. number of:
tweets per day, mentions,
retweets, favourited tweets
Can download
tweets in
MS Excel format
for further
ORSSA 2015 presentation 15
analysis
September 2015
13
14
Twitter APIs:
(1) REST APIs
OAuth:
positive,
negative, or
neutral.
the effect of one tweet may be small but the effect of many is
significant
Analysis methods:
16
tweets
to be
classified
Predictive
( classification)
model
-ve
sentiment
tweets
neutral
sentiment
tweets
17
Essay presentation
Topic topic 16
18
References
1. IBM Global Business Services (2012) Analytics: the real-world use of big data
how innovative enterprises extract value from uncertain data, IBM Institute
for Business value.
2. Moniruzzaman, A.B.M. & Hossain, S.A. (2013) NoSQL database: new era of
databases for big data analytics classification, characteristics and
comparison. International Journal of Database Theory and Application, vol. 6,
no. 4, 2013.
3. Wakade, S., Shekar, C., Liszka, K. J. and Chan, C.-C., 2012, Text Mining for
Sentiment Analysis of Twitter Data, International Conference on Information
and Knowledge Engineering, (IKE'12), pp. 109-114.
19