Efficiently querying archived data using hadoop
Rajeev Gupta, Himanshu Gupta, et al.
CIKM 2010
Data delivered over the Internet is increasingly being used to provide dynamic and personalized user experiences. Queries over fast-changing data from distributed data sources are executed to create content to be delivered to users. Because these queries require data from multiple sources, they're executed at intermediate proxies or data aggregators. The authors discuss various techniques for executing aggregation queries over distributed data to minimize the number of message exchanges between data sources, aggregators, and users. They carefully examine the problem in terms of different types of queries, aggregation functions, query imprecisions, and whether the aggregators get data from sources using pull- or push-based mechanisms. © 2006 IEEE.
Rajeev Gupta, Himanshu Gupta, et al.
CIKM 2010
Rajeev Gupta, K. Hima Prasad, et al.
ICAC 2008
Anoop George Ninan, Purushottam Kulkarni, et al.
IEEE Transactions on Knowledge and Data Engineering
Rajeev Gupta, Krithi Ramamritham
ICDE 2014