Categories
Tech

MEP 3.0 Of MapR Ecosystem Pack Provide Security For Apache Spark

Every Apache Spark development company has a reason to rejoice as MAPR Technologies has released the version 3.0 of MapR Ecosystem Pack (MEP) that is targeted towards providing improved security for Apache Spark, new Apache Spark connectors for MapR-DB and HBase, integrations with Drill, and faster version of Hive. The following provides an overview of […]

Categories
Tech

An Overview Of Data Clustering Techniques Provided by Apache Spark

Apache Spark is hailed for its exceptional data processing and analyzing capacities that are a result of its well-developed machine learning library (MLib). Data clustering is typically an offline process that groups several entities from the dataset based on set criteria for a particular cluster. Rather than example based learning that happens in data classification, […]