Integrations Journal



Big Data Profiling For Hadoop Simplified By Talend

Written by  Raja Rao | 14 November 2012
E-mail PDF

Talend Big Data PlatformCompanies are faced with exponential growth in the volume and heterogeneity of the data and applications they need to manage and control. A key challenge that IT departments face today is ensuring the consistency of their data and processes by using modeling tools, workflow management and storage, the foundations of data governance in any company today. 

Talend, an open source software provider, announced the availability of its next-generation integration platform that provides a unified environment for managing the entire lifecycle across data, application and process integration requirements, including big data profiling for Hadoop. The Company is one of the pure play vendors of open source software, offering a breadth of middleware solutions that address both data management and application integration needs. It makes enterprise grade data management and application integration solutions available to organizations of all sizes. It also supports widely-used and deployed NoSQL databases, and a set of improvements that increases product usability and performance across the entire platform.

Firms increase IT productivity while lowering costs by creating a federated, data virtualization layer of reusable services. Point-to-point dependencies between data sources and targets are removed, lessening the impact of changing data structures. The Company provides applications that accelerate data loading and other aspects of Hadoop setup by enabling developers and analysts to leverage powerful Hadoop technologies like Hadoop Hive, Pig, and Sqoop without having to write Hadoop code.

The two big data integration solutions of the company are Talend Open Studio for Big Data and Talend Platform for Big Data. Talend Open Studio is a powerful and versatile set of open source products for developing, testing, deploying and administrating data management and application integration projects. Developers achieve vast productivity gains through an easy-to-use, Eclipse-based graphical environment that combines data integration, data quality, MDM, application integration and big data. Talend Platform for Big Data adds data quality, advanced deployment and management functions across the enterprise. It provides a big data solution that includes a broad set of tools to help you load, extract and improve disparate data while leveraging the massively parallel processing power of big data technologies including Apache Hadoop.

“Talend version 5.2 delivers on our vision of simplifying the development, integration and management of big data so that businesses can focus on using that data to make faster and more informed decisions,” said Fabrice Bonan, co-founder and chief technical officer, Talend. “We provide the most powerful and versatile open source, big data solution to help organizations load, extract and improve disparate data while leveraging the massively parallel processing power of big data technologies including Apache Hadoop and leading NoSQL databases.”

With Talend’s support for over 450 connectors and advanced data transformation capabilities, organizations can access any application and data, stored anywhere. Talend connects natively to databases, packaged applications (ERP, CRM, etc.), SaaS and Cloud applications, mainframes, files, Web services, data warehouses, data marts, and OLAP applications. Some of the Big Data Distributions and NoSQL supported by Talend are Hortonworks Data Platform, Apache Hadoop, Cassandra, Cloudera, Google BigQuery, Greenplum, MapR, MongoDB, Terradata and Vertica. Some of the well-known databases supported by Talend are Amazon RDS, Apache HBase, AS400, DB2, HIVE, JavaDB, JDBC, Microsoft OLE-DB, Microsoft SQL Server, MySQL, Oracle and PostgresSQL.

The product is available for download.

Raja Rao

Raja Rao

RajaRao is our Cloud Journalist writing for ToolsJournal on Cloud Tools, Latest updates, Cloud Quick Lists and more. He has done his Bachelors of Engineering in Civil and has been in IT for over 15 years with good expertise on CRM, Peoplesoft, SAP, Microsoft and .Net technologies. Raja comes with fantastic blend of Technical, Solutions and Delivery expertise. Loving the concept of ToolsJournal he has joined us full time and has been tremendous help to take this portal forward. 

You can reach Raja at 

blog comments powered by Disqus