Apache Hadoop® is an open source platform providing highly reliable, scalable, distributed processing of large data sets using simple programming models. Hadoop is built on clusters of commodity computers, providing a cost-effective solution for storing and processing massive amounts of structured, semi- and unstructured data with no format requirements. This makes Hadoop ideal for building data lakes to support big data analytics initiatives.