To understand what is required to support new innovative Internet applications, a solid understanding of Internet content characteristics (size, distribution, form, structure, evolution, dynamic) is necessary. The LAWA project will build an Internet-based experimental testbed for large-scale data analytics. Its emphasis is on developing a sustainable infrastructure, scalable methods, and easily usable software tools for aggregating, querying, and analyzing heterogeneous data at Internet scale.