Hadoop Featured Articles
AMD's SeaMicro SM15000 Server Now Certified for Cloudera's Distribution Including Apache Hadoop Version 4
Advanced Micro Devices (AMD), a semiconductor design innovator, recently announced that its SeaMicro SM15000 server is now certified for CDH4, Cloudera's Distribution Including Apache Hadoop Version 4.
MapR Technologies Secures $30 Million in Funding for Global Expansion, Investment in R&D
Today, the MapR Big Data Platform is being used in production deployments across financial services, government, healthcare, manufacturing, retail and Web 2.0 companies to drive significant business results and includes the analysis of hundreds of billions of objects a day, 90 percent of the Internet population monthly and more than a trillion dollars of retail transactions annually.
Twitter and Storm Enable Real-Time Hadoop with MapReduce
MapR and Twitter recently demonstrated real-time Hadoop analytics at the Strata Conference at the end of February. By harnessing the power of the Twitter API, the two companies streamed the #strataconf hashtag directly into a cluster.
MapR Direct Access NFS Makes Running MapReduce Easier
For a lot of people who deal with Hadoop, one of the most complex and frustrating jobs can be running MapReduce on ingested data. The task is usually done in batches, with data needing to be transferred to local files that the Hadoop cluster can get at, then the whole thing needs to be copied to HDFS, either with hadoop.fs or with Flume. After all that transferring, then, finally, can MapReduce be run at all. But now, thanks to MapR's new Direct Access NFS feature, the whole process is about to get a lot simpler, requiring a lot less transferring.
MapR Technologies to Present Big Data and Hadoop Insight at Future Conferences
MapR Technologies, a provider of Hadoop distribution, will be presenting its insights related to big data and Hadoop at three forthcoming conferences. The organization will also exhibit its prize-winning MapR Distribution at these conferences. MapR Technologies will be attending Gartner Business Intelligence & Analytics Summit, GigaOm Structure:Data and SVForum: Big Data and Analytics Conference Between Now and 2020, all scheduled in March.
En Pointe Becomes First VAR to Sell Intel Distribution for Apache Hadoop within Public, Private Cloud
En Pointe Technologies recently announced that through its partnership with Intel, it has become the first value-added reseller (VAR) to offer hardware, software and services support to enterprise customers, enabling them to optimize their Apache Hadoop deployment. En Pointe has been continuously investing in cloud, collaboration and new technologies which have helped it consolidate its foundation within the big data space.
SimpleSearch Makes Hadoop Data More Accessible
Let's not kid ourselves; Hadoop is doing well. Some would say very well; a recent eWeek article declared that 2013 would be the year that Hadoop beat out the big data analytics competition.
HP Converged Infrastructure Improves IT Agility, Reduces Cost
In just three years, the HP Converged Infrastructure portfolio has outpaced the competition with substantial benefits delivered to business and government customers around the world.
Sorting 1.5 Terabytes of Data in 59 Seconds: MapR Technologies Breaks MinuteSort Record
One terabyte is 1,000 gigabytes, or the equivalent of 6.5 Netbooks, the storage space on one standard Dell Inspiron desktop computer, 250,000 images that are 4 megs in size each, 128 8-Gig movie DVDs or 20 Blu-ray discs. MapR Technologies sorted more than that in less than one minute.
Hadoop: The Next Big Thing in 2013
The last three years have been significant for the Java-based programming framework, Hadoop, and not because it has an unusual name. When it comes to big data, IT professionals agree: Hadoop is set up to dominate with more capabilities to come in 2013.
Supermicro Unveils Big Data Solution Supporting Intel Distribution for Apache Hadoop
Supermicro, a provider of application-optimized server, workstation, blade, storage and GPU systems, has launched big data solutions with support for the new Intel Distribution for Apache Hadoop Software. To meet these requirements, Supermicro's Hadoop-optimized server and storage systems have undergone rigorous testing and validation.
Hadoop is Emerging as the Dominant Big Data Analytics Platform
When we tweak photos, we "Photoshop" them. Soon, when we crunch big data we might consider impressing our friends by "Hadooping" the data. At least that's the direction advanced analytics products are going, with Gartner predicting that two-thirds of such products will have Apache Hadoop embedded within them by 2015.
MapR Plans Sessions to Highlight Impact of Hadoop on Competitive Advantage
MapR Technologies, a specialist in Hadoop, plans to highlight the impact of Hadoop on a company's competitive advantage in a series of sessions. Hadoop was created to analyze different types of big data covering data files that are structured or unstructured, logs, pictures, audio files, communications records and e-mail, which are transferred over networks every day.
MapR Technologies and HP Unveil Big Data Reference Architecture
Today, we are dealing with all different kinds of data files: structured, unstructured, logs, pictures, audio files, communications records, e-mail and more are all dealt with and transferred over networks every day. Hadoop was created to analyze different types of big data, however, it is complex to deploy, configure, manage and monitor. Before deploying a Hadoop cluster, things to consider including operating systems, computation, memory, storage, network, switches and data movement in and out of the cluster.
HStreaming Announces Funding Round Led by Atlas Ventures
HStreaming, a provider of solutions for realizing continuous real-time analytics to big data, recently raised an undisclosed amount of venture funding by Atlas Venture, an early stage venture capital firm.
WANdisco Unveils Production-Ready Apache Hadoop 2 Distribution for Big Data
Big data specialist WANdisco has announced the availability of WANdisco Distro (WDD), the world's first production-ready Apache Hadoop 2 distribution for big data, for free download.
Is Apache Drill Proving to be a Valuable Tool in the Hadoop Toolbox?
The Apache Drill project launched back in August 2012, and with six months between the launch and the present day, some are already taking a look back at the project to see if it's proving its worth in the field. With advances being made on several fronts, the idea of use-cases for the Apache Drill are beginning to come into light, and the total value of the system is starting to make itself known.
Big Data Coming Into its Own as Stable, Mature Technology
There's no denying that technology, especially these days, is moving fast. New developments emerge on a regular basis and developments that were formerly new find their metaphorical chrome peeling away to reveal either flashes in the pan or the solid underpinnings of stable technologies. When the Gartner Hype Cycle--a process by which many technologies can be tracked from new and shiny to mature and stable and even to obsolete if one goes out far enough--was applied to big data, the unexpected came back: big data is further along than many predicted.
Supermicro Offers Hadoop-Optimized Hardware and Solutions
Enterprises today collect and generate more data than ever before. Hadoop is an open-source framework for running applications on large clusters of commodity hardware, designed to solve the scalable, reliable storage and analysis of both structure and complex data. Ted Dunning, architect at MapR Technologies, recently presented "The Power of Hadoop to Transform Business," talking about the future of Hadoop. Integration of Hadoop with traditional IT makes a big difference in the way companies can use scalable computing.
MapR Distribution Used to Analyze 90 Percent of Internet Population Each Day
Hadoop technologies specialist MapR recently released a list of its "customer wins" for 2012, which provides further insight into the spike in demand the company experienced for its analytics software over the last year, particularly from large consumer-facing organizations.
Dataguise Partners with MapR for Apache Hadoop
Dataguise has recently formed a partnership with MapR Technologies to manage complex data analysis workloads and prevent unauthorized access to data on Hadoop platform. Under this partnership, DG for Hadoop is now certified for use with the MapR Distribution for Apache Hadoop. This certification also assures data privacy protection and delivers risk assessment intelligence for enterprises using the MapR Hadoop distribution. This is win- win deal for both companies.
Dataguise Partners with MapR for Apache Hadoop
Dataguise has partnered with MapR Technologies to create a way to manage complex data analysis workloads and prevent unauthorized access to data on the Hadoop platform. Under this partnership, DG for Hadoop is now certified for use with the MapR Distribution for Apache Hadoop. This certification also assures data privacy protection and delivers risk assessment intelligence for enterprises using the MapR Hadoop distribution, a win- win deal for both companies.
Hadoop Specialist MapR Wins Big Customers across Broad Section of Industries
MapR Technologies, a specialist in Hadoop technologies, has achieved significant customer wins across a broad section of industries.
WANdisco, Cloudera Accelerate Adoption of Hadoop-Based Business Solutions
The Cloudera Connect Partner Program from Cloudera, a company specializing in Hadoop and big data, focuses on accelerating the innovative use of Apache Hadoop for a range of business applications. The program is gaining wider support from companies focusing on big data applications.
Upgrading Hadoop with Real-Time Interactive Queries
The growing volume of data continues to drive the need for capable and cost-effective analytical tools. As a result, driven by technology advances, Hadoop gained significant traction in the marketplace last year. Suitable for a more broad range of organizations and use cases, Hadoop is predicted to establish its dominance in big data analytics with the addition of even more capabilities.
R Programming Language Can Manipulate Hadoop Now with RHadoop
Statisticians who are familiar with the R programming language now are better able to use Hadoop to run MapReduce jobs or access HBase tables. Revolution Analytics has created RHadoop, a collection of three R packages that let users run MapReduce jobs entirely from within R as well as giving them access to their Hadoop files and HBase tables, according to a recent MapR Technologies blog post.
New Leadership Team Helps MapR Meet Growing EMEA Demand for Hadoop
MapR Technologies, the open, enterprise-grade distribution for Hadoop, recently launched its European operation to meet the needs of its growing community of customers and partners across the region. The new European headquarters in London, England, will provide MapR with a base for technical and sales resources to accelerate the adoption of its high performance, enterprise-grade distribution for Hadoop.
Create Virtual Hadoop Cluster Environments in Less Than 10 Minutes with Skytap Cloudera Hadoop for Enterprise Hybrid Clouds
"Big data" is a collection of information that is so large you can't use normal means to process it. With six billion mobile subscriptions worldwide, more than one billion Facebook users and 400 million tweets per day, the volume of digital content is expected to reach the equivalent of 18 Libraries of Congress by 2015. So, a solution called Hadoop was born. Hadoop is an open-source way of storing and processing data. It can handle all types of data from disparate systems, such as structured, unstructured, log files, pictures, audio files, communications records and e-mail.
CRGT Acquisition Expands Big Data Analytics Capabilities
Guident Technologies has come under the umbrella of CRGT, a provider of full life-cycle IT services and emerging technology solutions for the Federal Government and a Veritas Capital portfolio company. This acquisition proves CRGT's expansion into key technology growth markets, bringing high-end capabilities in the arenas of big data analytics and business intelligence solutions to CRGT's existing portfolio of IT service offerings.
Hadoop Changing the ETL Process
Is Hadoop fundamentally changing the data warehousing equation?
Open Source: Driving Big Data into 2013
Big data has quickly turned into the biggest thing to hit information technology since the virtualization craze of the last decade. According to research firm Wikibon, the big data market is on the verge of a rapid growth spurt that will hit $50 billion worldwide within the next five years. The rate at which the importance and popularity of big data has grown can be directly attributed to open source. Most of the new big data frameworks and databases have their roots in the open source world, where developers routinely create new approaches to problems that haven't yet hit mainstream.
2013: A Big Year for Big Data?
With today being the last day of the year and all, it's not hard to look forward at the upcoming year and wonder just what will happen. In turn, it's no surprise that the various big data concerns out there also took a look and gave some of their predictions about what was likely to happen in this space.
The H(app)athon Project Redefines Big Data Analytics
Dealing with big data is critical for large enterprises that depend on their networks for business continuity. Big data analytics applications like Hadoop enable organizations to investigate, troubleshoot and diagnose network, security and application related problems. A new project called the H(app)athon Project is helping organizations deal with the challenges associated with handling big data.
Hadoop Demonstrates Open Source Power with Overstock App
When running a business, it's key to ensure you have the latest innovations that help you get your product to market, meet the needs of your customer base and still turn a profit. Such innovations vary according to the company and the industry, but the needs exist all the same. But, what if you could get exactly what you need and it didn't cost you any money - what would that mean for your bottom line?
Reigning in Big Data: Enterprises Move to the Cloud to Manage Storage Needs, Improve Accessibility
Documents, e-mails, contracts, media and graphics - these are just a sample of the electronic content that businesses around the world generate each day. And while they're not filling file cabinets and taking up valuable square feet of real estate, they are consuming valuable storage resources.
Oracle Unveils New Big Data Appliance X3-2
Oracle has unveiled its new Big Data Appliance X3-2 - which may prove attractive to many businesses and other organizations looking to upgrade their technology in the big data age. The Oracle Big Data Appliance X3-2 includes hardware and software which features Intel's new processors, Apache Hadoop (CDH), Cloudera Manager and the new Oracle Enterprise Manager plug-in for Big Data Appliance.
Exar Gives Insight on Maximizing Hadoop Performance with Hardware Compression
Hadoop is an open-source Apache implementation project. The project aims to deal with the big data challenges by facilitating the storage and processing of big data.
Infochimps Enterprise Cloud Delivers Huge Benefits to Enterprises
By deploying Infochimps Enterprise Cloud, one can easily bring down the risk, time-to-value and complexity involved in enterprise big data projects.
Hadoop May Not Be Perfect, but MapR's Distribution Aims to Be
A shift has occurred in the way companies collect data, compared to the way it has traditionally been done. While before, the only data that was needed to answer a specific question was gathered, companies now cast a wide net in an effort to gain a better understanding of their customers, while striving to gather more customers. These massive stores of data are called big data and have led to an industry dedicated to making sense of this data.
ExtraHop Updates Application Performance Management for Big Data
Today the term big data has come into use recently to refer to the increasing amount of information that organizations are storing, processing and analyzing. To effectively use the vast amounts of data, ExtraHop Networks, a provider of network-based application performance management (APM) solutions, has launched its SAP Sybase IQ Module, designed to give IT organizations operational intelligence into big data analytics and data warehousing environments.
MapR Expands to Europe
With the proliferation of mobile and Internet-connected devices, the amount and size of data in industries today is growing, fast. Hadoop is an open-source framework designed to process and store big data. It makes data mining, analytics and processing of big data cheap and fast.
Hortonworks and Luminar Partner to Enhance Customer Big Data Management
Luminar, a data analytics and modeling provider focused specifically on connecting marketers with U.S. Latino consumers, is using the Hortonworks Data Platform (HDP) to deploy fully integrated big data architecture. The announcement about this agreement has been made by Hortonworks, a contributor to Apache Hadoop.
The Advantages of MapR Technologies' Hadoop Distribution
Hadoop was recently described as a "three-headed open core" run by Cloudera, Hortonworks and MapR Technologies. Which begs the question: Which head should you choose when planning to leverage Hadoop?
LucidWorks, MapR Technologies to Discuss Unlocking the Secrets of Big Data with Search and Hadoop
According to research firm IDC, the total amount of digital data, or big data, will reach 2.7 zettabytes by the end of this year. Approximately 90 percent of this data will be unstructured. By transforming this unstructured data into business insights, the businesses can gain important competitive advantages. LucidWorks, a developer of search, discovery and analytics software based on Apache Lucene and Apache Solr technology, and MapR Technologies, a developer of Apache Hadoop-derived software, have teamed up to jointly host a webinar that will highlight the ways big data can be tapped and leveraged for creating business value.
Tervela Turbo, Now Certified on CDH4, Helps Implement Mission-Critical Hadoop Systems
Big data is playing a growing and critical role in day-to-day business operations, helping companies compete more effectively and become more efficient. However, difficulties in capturing this data and delivering it to front-line business systems have slowed down what companies can do with the data at their fingertips. Apache Hadoop was born out of necessity as data from the Web exploded, and grew far beyond the ability of traditional systems to handle it.
MapR Technologies' Jack Norris to Discuss Hadoop in the Cloud
Around 80 percent of data from big data is unstructured. With this massive quantity of unstructured data, businesses need faster, more reliable and deeper data insights. Therefore, big data solutions based on Hadoop and other analytics software are becoming more and more relevant. One of Hadoop's strengths is that it can process and analyze huge amounts of unstructured data - video, audio, social media postings, images, etc. - in ways that were previously impossible.
Mortar Data Makes Hadoop Collaboration Easy
Mortar Data is a startup that only just publicly launched in late November. It is also an open-source development framework for Hadoop that is built specifically for collaboration, allowing for easy sharing, repeating and maintaining of code. Mortar was initially only offered as a hosted Hadoop service, but the company has now released the open-source Mortar framework for Hadoop applications.
Archimedes Selects Univa Grid Engine for Healthcare Hadoop
Today, the Web is exploding with a tsunami of data inundating organizations like never before, and traditional systems are shying away from storing, let alone analyzing, the thousands of petabytes of data. The birth of Apache Hadoop has provided them with a fundamentally new way for efficient handling of exponential data.
Big Data Partnership Joins Microsoft Big Data Partner Incubation Program
London-based big data consultancy firm, Big Data Partnership, is one of the few organizations that Microsoft has handpicked to participate in its Big Data Partner Incubation Program. As part of this program, Big Data Partnership will collaborate with Microsoft to offer Microsoft HDInsight, an Apache Hadoop-based solution for the Windows Azure and Windows Server platforms that make managing big data easier for the enterprises.
From Big Data Comes Big Opportunity: Five Ways to Make Hadoop & Big Data Work for You
Big data me