MapR Enhances its Real-Time Processing Capabilities for Big Data Analysis

MapR Logo 300x70 MapR Enhances its Real Time Processing Capabilities for Big Data AnalysisThe big data platform MapR just introduced version 5.0 of its Hadoop distribution based on version 2.7 of the open source framework designed for the processing of very large volumes of data with the support for Docker containers. MapR 5.0 also relies on the Yarn resource manager.

This version strengthens the operational capacity real-time platform. In particular, it extended the highly reliable data transport framework used in the function table MapR-DB Replication (which allows replication between multiple data centers) to provide data to external motors and synchronize in real time.

Compared to other Hadoop distributions, MapR extends the functionality of the framework on security aspects (data protection, user authentication, disaster recovery), but also high availability and performance. Version 5.0 brings further improvements in governance, with a full audit access to data through JSON and Apache Drill Views of support for secure access to data analyze.

More and more companies deploy multiple applications on the same Hadoop cluster. In this context, the latest MapR manages automated synchronization of storage, databases and search index.

To facilitate the deployment of Hadoop clusters, the publisher has also included new models of self-provisioning to set up a cluster as if it were an appliance without using specific hardware. These models can be deployed using the MapR installer. Among the possible configurations, there are the Lake Data services, data mining (Interactive SQL with Apache Drill) and analysis of operational data (basic and MapR NoSQL-DB).

The Apache project will help in the analysis and the use of batch processes and their pipelines with rapid and extensive calculations. The announced distribution automatically synced storage, databases and search indices to allow complex real-time applications. It also has new auditing capabilities.

MapR Technologies intends to continue its growth in big data and analytics-segment. In the context of the MapR database now has the ability to the table replication to synchronize data in real time and make it available for external calculators. The first case that is based on Lucene search platform Elasticsearch is supported to enable synchronized full-text search indexes automatically.

Last year, MapR and Apache Spark integrated their technologies to offer its users an all-around the clock support for Spark to develop the solution and related projects at a faster rate and to integrate more innovative changes. In addition, the two companies are working together on a rapid development of the software and other complementary innovative new features. This will pay off for MapR customers and the Hadoop community well over the coming years.

Recently, Oracle released a new software product that is designed to help big data demands. This product called Oracle Big Data Spatial and Graph provides new analytical capabilities for Hadoop and NoSQL. Oracle created the product so that it can process data natively on Hadoop and parallel on MapReduce using structures in memory.


CloudTimes

Major IT Players Form R Consortium to Strengthen Data Analysis

r consortium 300x199 Major IT Players Form R Consortium to Strengthen Data AnalysisThe Linux Foundation announced the formation of R Consortium, with the intention of strengthening technical and user communities around the R language, the open source programming language for statistical data analysis.

The new organization R Consortium became an official project of Linux Foundation and is designed to strengthen R language users.  It is expected that R Consortium will complement the existing fund, and will focus on expanding the user base of R, as well as focus on improving the interaction of users and developers.

The Representatives of the R Foundation and industry representatives are behind the new consortium. Microsoft and RStudio have joined the consortium as platinum members. TIBCO Software is a gold member and Alteryx, Google, HP, Mango Solutions, Ketchum Trading and Oracle have joined as silver members.

R Consortium will complement the work of R Foundation, establishing communication with user groups and engaging in supporting projects – related to the creation and maintenance of R mirror sites, testing, resources for quality control, the financial support and promotion of the language. Also, the consortium will assist in creating support packages for R and organizing other related software projects.

R is a programming language and development environment for scientific calculations and graphics that originated at the University of Auckland (New Zealand). The R language has enjoyed significant growth and now supports more than two million users. A wide grass industries adopted the R language, including biotech, finance, research and high-tech industries. The R language is integrated with frequency analysis, visualization, and reporting applications.

Having acquired the company Revolution Analytics (which makes strong use of language), Microsoft announced that it is joining the consortium together with other founding members such as Google, Oracle, HP, Tibcom, Rstudio, Alteryx to finance the new consortium.

Microsoft’s official said that “the R Consortium will complement the work of the R Foundation, a nonprofit organization that maintains the language, and will focus on user outreach and other projects designed to assist the R user and developer communities. This includes both technical and infrastructure projects such as building and maintaining mirrors for downloading R, testing, QA resources, financial support for the annual useR! Conference and promotion and support of worldwide user groups.”

Google also says they have thousands of users and their own developers using R, so this language is crucial for many of their products. Google is happy to join the rest of companies to continue to maintain the infrastructure of the open source R.

Microsoft’s support of real-time analytics for Apache Hadoop in Azure HDInsight and machine learning in Azure Marketplace use R language to service anomaly detection for preventive maintenance or detection of fraud.


CloudTimes