Proceedings:
Proceedings of the International AAAI Conference on Web and Social Media, 6
Volume
Issue:
Vol. 6 No. 3 (2012): ICWSM Workshop Technical Report WS-12-02 (Real-Time Analysis and Mining of Social Streams)
Track:
Real-Time Analysis and Mining of Social Streams
Downloads:
Abstract:
We present a novel approach to collecting and distributing social media data in web service projects using both clients and servers for real-time analysis, ultimately providing an inexpensive and scalable method of a quality that has not been available to date. Current challenges to social data mining include vendor enforced API limits and infrastructure costs. Our hybrid client / server approach allows data to be collected via JavaScript in browsers as well as by servers. This allows applications to compute a wide range of data analytics. We present pure client and server based collection strategies, then demonstrate how our method has substantial advantages over both. Specific advantages include lower infrastructure requirements and greater efficiency in API utilization. Our approach distributes the majority of data collection tasks to client web browsers while using servers to supply more complex analysis techniques. In addition, we provide details on two open source tools we have released to facilitate implementation by researchers in their own projects. We close by detailing a use case scenario describing a large scale public web service project followed by a solution accomplished using our approach and open source tools.
DOI:
10.1609/icwsm.v6i3.14353
ICWSM
Vol. 6 No. 3 (2012): ICWSM Workshop Technical Report WS-12-02 (Real-Time Analysis and Mining of Social Streams)