Abstract: eBay has one of the most sophisticated Data Platform’s in the industry with over 200PBs of data stored in our Hadoop and Teradata Warehouses. On average 30 TB of transactional and behavioral data is extracted on a daily basis and thousands of metrics are computed, analyzed and monitored for decision making and detecting anomalies. eBay has embarked on an ambitious project to transform the batch oriented ETL processes which could take 24 to 48 hour to near real time infrastructure. Apache Big Data Projects continue to play a critical role in this transformation process.
Seshu Adunuthula is Sr Director of Analytics Infrastruture at eBay responsible for managing some of the world’s largest deployments of Hadoop, Teradata and ETL Ingest platforms. He is an industry veteran with over 20 years of Distributed Computing and Analytics Experience. Prior to eBay he was managing the Engineering team at MapR responsible for MapReduce, MapR-DB and MapR Control System Products. Seshu also held various Engineering Manager and Architect roles at Oracle and Microsoft. Follow Seshu on Twitter @seshuad