Alan Gates describes how the Apache Pig* high-level data flow programming language and execution framework makes it easy to create Apache MapReduce* applications. This overview by an expert from the Apache Hadoop* open-source community covers how the Pig platform fits into the Apache Hadoop framework; the value of Pig Latin, an easy-to-learn programming language that focuses on data flow; the difference between the Pig platform and the Apache Hive* data warehouse infrastructure; Pig limitations; and where development of the platform is headed. Part of the Intel® IT Center’s Apache Hadoop Community Spotlight series. Also listen to the podcast of the interview.
Apache Pig* overview.
Apache HDFS* overview.
Apache MapReduce overview.
Linda Feldt highlights big data research—video
The Intel® Distribution for Apache Hadoop* Software