Microsoft: SCOPE - Easy and Efficient Parallel Processing of Massive Data Sets

A new whitepaper that Microsoft researchers are set to present at a conference next month sheds more light on Microsoft's back-end cloud infrastructure. The paper, entitled, “SCOPE: Easy and Efficient Parallel Processing of Massive Data Sets,” details a new declarative scripting language that is optimized for storing and analyzing massive data sets (like search logs […]

A new whitepaper that Microsoft researchers are set to present at a conference next month sheds more light on Microsoft's back-end cloud infrastructure.

The paper, entitled, “SCOPE: Easy and Efficient Parallel Processing of Massive Data Sets,” details a new declarative scripting language that is optimized for storing and analyzing massive data sets (like search logs and click streams) that are key to cloud-scale service architectures. SCOPE, or Structure Computations Optimized for Parallel Execution, is the name of the language.

Scope is similar to Yahoo’s Pig, which is a higher level language on top of Hadoop, or Google’s Sawzall, which is a higher level language on top of MapReduce. But, where Pig focuses on and advocates a more imperative programming style, Scope looks much more like SQL.”

Full Article