Tuesday, May 21, 2013

Machine learning climbs atop Hadoop


Hadoop whisperer Concurrent has released a free tool for porting machine-learning models over to Hadoop.The Pattern tool lets you run machine-learning models on top of the Hadoop compute and storage framework via either exported Predictive Model Markup Language (PMML) files or a Pattern Java API.Designing a vintage bathroom doesn't mean that you should sacrifice the efficiency of Cast iron tubs.Rather than ending up in a landfill, reclaimed wooden flooring, tiles or marble fixtures can be given new life in a vintage bathroom.Designing machine-learning models requires a precise set of skills, and though the technology can bring great efficiencies by creating automated programs that can, say, automatically score query results by relevance, it is rare that machine-learning experts – who are a subcategory of the data scientist breed of tech bod – are also familiar with the vagaries of MapReduce jobs.Leather drawstring backpack is preferred by many for their rich appeal. Choose the Banner Pen you feel is apt after browsing thru a wide range of collections.
Rather, many data scientists work within the confines of mathematical or machine-learning programs such as R or MicroStrategies – and it can be a tall order for these people to learn HDFS and MapReduce sufficiently to re-implement their algorithms on large HDFS-stored datasets.So be careful if you're using a united-promo code on their website.With Patterns, Concurrent has created a free technology that can take machine-learning models exported into PMML files and run them atop Hadoop. "You should be able to export from your favorite tools your PMML docs and get into production at least at scale," Concurrent founder Chris Wensel says. "The goal with Pattern is to be able to apply a [machine-learning] scoring model and run it at scale."Pattern is the third prong in Concurrent's pitchfork for getting useful data in and out of Hadoop without having to learn the vagaries of the application. It sits alongside the company's Java API for Hadoop and its Lingual add-on for making SQL queries on Hadoop easy.
The tool is designed for data scientists who are unfamiliar with Hadoop but want to use the technology to run machine-learning models against large pools of data. It works with any program capable of exporting a model as a PMML file – R, MicroStrategies, SAS, and so on."We've used the Cascading APIs and implemented the scoring aspect of these models against the cascading APIs,Generally the flask bottle has a cylindrical opening with interior and outer space separated by a thermos flask. This unique quality helps maintain liquid or solid stored in the bottle at the required temperature it is." Wensel says. "It'll generalize itself thanks to the facilities Hadoop provides. If you export the model from R into PMML and run [it] across Hadoop, it'll parallelize itself appropriately."Pattern is part of Concurrent's overall strategy of shifting Cascading into an all-purpose translation layer for people who want to access the inherent scalability of Hadoop without having to invest time in learning its peculiarities.Its closest contemporary would be the open source Apache Mahout project. However, Mahout is more a selection of HDFS-compatible machine learning algorithms than anything else, so it lacks the flexibility and tooling that software like R may have.Pulp inflammation of the cavities filling must be careful tube by carbon sheets or acute pulpitis pain after treatment or endodontic treatment, may be rude or drugs stimulate.

No comments:

Post a Comment