Home   |   Company   |   Services   |   Technology Partners   |   Contract Vehicles   |   News & Events   |   Contact Us
Talend

Talend Overview

Talend Open Studio

Talend's flagship product, Talend Open Studio, is the most open, innovative and powerful data integration solution on the market today.

Provided as a packaged, out-of-the-box, ready-to-install platform, Talend Open Studio meets the data integration requirements of all organizations – regardless of their size or level of data integration expertise.

Talend Open Studio is a robust product that runs complex integration processes even in the most demanding environments.

 

Talend Integration Suite

The leading open source enterprise data integration solution, Talend Integration Suite supports the tough requirements of enterprise development, and scales to the highest levels of data volumes and process complexity.

Talend Integration Suite is a subscription service that extends award winning Talend Open Studio with professional grade technical support and additional features to facilitate the work of large teams and industrialize enterprise-scale deployments.

 

Talend Integration Suite MPx

Based on Talend's award winning enterprise data integration technology, Talend Integration Suite MPx is a highly scalable, massively parallel data integration platform that scales to the highest volumes of data.

Geared toward enterprises that need to process extreme data volumes in ever tightening time windows, Talend Integration Suite MPx exceeds the most demanding requirements and supersedes all existing performance benchmarks.

 

Talend Integration Suite RTx

Based on Talend's award winning enterprise data integration technology, Talend Integration Suite RTx is the real-time data integration platform of choice for enterprise application integration needs.

Talend Integration Suite RTx handles high data throughput on a message-oriented architecture and the recognized standards of real-time integration.

 

Talend On Demand

The industry's first data integration Software as a Service (SaaS), Talend On Demand consolidates Talend Open Studio metadata and project information in an online, shared repository hosted by Talend.

Talend On Demand allows project teams of any size to consolidate their work in a centralized and shared repository, facilitating collaboration, object and code reuse, and promoting development best practices.

Because of the SaaS model, Talend On Demand does not require any configuration or administration. Talend Open Studio users keep using the design and runtime environments they are familiar with, while the installation, the configuration and the backup of the centralized repository are handled remotely by Talend.

 

Talend Open Profiler

Data profiling is the process of examining the data available in existing data sources (e.g. databases, applications, files, etc.) and collecting statistics and information about this data. Data profiling enables the assessment of the quality level of the data contained in the information system, according to a defined set of metrics and goals.

The first open source data profiling tool, Talend Open Profiler, allows business users or data management staff to define a set of indicators for each data element that needs to be analyzed or monitored. It produces sophisticated reports and graphs that let users gauge at a glance the level of quality of the data, and the status of the indicators that were defined.

 

Talend Data Quality

Data quality entails more than helping companies get correct data into their information systems; it also means getting rid of bad, corrupted, or duplicate data. Clean data is key when integrating information across systems, because misinformation can proliferate quickly.

The first open source data quality solution with enterprise-grade features and technical support, Talend Data Quality is a graphical data quality management environment that processes data, such as addresses, phone numbers, spellings, synonyms and abbreviations. Talend Data Quality includes both data profiling and data cleansing capabilities.

 

Talend MDM

Mastering this data is difficult. It resides in multiple sources. Ownership of the data presents political challenges and it is typically a moving target, changing often.

The most complete open source master data management solution, Talend MDM provides Enterprise-grade features to synchronize, quality-assure and provide a single version of master data throughout and across the organization's systems.

Talend Open Studio

Talend Open Studio provides advanced capabilities that dramatically improve the productivity of data integration job design and proven scalability to ensure optimal execution.

Business modeling

Talend Open Studio: Business Modeler

Talend Open Studio's Business Modeler leverages a top-down approach, allowing line-of-business stakeholders to get involved in the design of the integration processes and to monitor development progress. The Business Models are non-technical and business-oriented views built using the convenient library of shapes and links.

The Business Modeler also regroups all relevant documentation supporting the open source data integration, data migration and data synchronization processes in a business-friendly diagram. This is a very efficient way of monitoring the Jobs and performing impact analysis if a problem arises.


Talend On-Demand

Talend On Demand is the industrys first data integration Software as a Service (SaaS).

Talend On Demand consolidates Talend Open Studio metadata and project information in an online, shared repository hosted by Talend. It allows project teams of any size to consolidate data integration work in a centralized and shared repository, facilitating collaboration, object and code reuse, and promoting development best practices.

Teamwork enabled by Talend On Demand enhances the flexibility of the development process and adds value to the overall organization. The Software as a Service model provides the same level of flexibility to both local and distributed project teams, while enforcing strict security for the project metadata.

The following information is stored in the Talend On Demand shared repository:

  • Connection information and data structures for source and target systems (business applications, databases, files, Web Services, etc.)
  • Talend Open Studio Business Models, with their non-technical description of data integration processes and associated project documentation
  • Integration Jobs, including all properties of components and connectors, data transformations, mappings, etc.
  • Administrative information (projects hierarchy, user permissions, etc.)

Talend On Demand leverages Talend Open Studio as its design and runtime environment. It is important to note that no actual enterprise data is stored outside the organizations firewall  only project information is consolidated in the repository. Talend On Demand keeps the actual data safe!

Because it relies on the SaaS model, Talend On Demand does not require any configuration or administration. Talend Open Studio users keep the design and runtime environments they are familiar with, installation configuration and backup of the centralized repository is handled remotely by Talend.

Talend Integration Suite

Talend Integration Suite is the first open source enterprise data integration solution, designed to support multi-user development, and to scale to the highest levels of data volumes and process complexity.

Talend Integration Suite is a subscription service that extends award winning Talend Open Studio with professional grade technical support and additional features to facilitate the work of large teams and industrialize enterprise-scale deployments.

Talend Open Studio is the core of Talend Integration Suite. Its three main applications, Business Modeler, Job Designer, and Metadata Manager, constitute the primary work environment of business users and integration process developers for data integration, data migration or data synchronization jobs.

Teamwork and consolidation of development

Talend Integration Suite: Business Modeler

Talend Integration Suite's Shared Repository is designed to consolidate all project information and enterprise metadata in a centralized repository shared by all stakeholders in the integration processes: business users, job developers, and IT operations staff-all of whom can access the same, single version of the truth. This shared repository facilitates collaboration between team members by allowing them to store and share their business models, data integration jobs, and metadata in an industry-standard source manager (SVN).

This promotes reusability of objects and code, as well as facilitating the design of development best practices that can then be leveraged by all developers for building data integration, data migration and data synchronization jobs.

The Shared Repository features advanced collaboration capabilities that include object-level check-in and check-out, as well as users, roles, permissions & privileges management.


Talend Integration Suite MPx

Based on Talend's award winning enterprise open source data integration technology, Talend Integration Suite MPx is a highly scalable, massively parallel data integration platform that scales to the highest volumes of data.

Geared toward enterprises that need to process extreme data volumes in ever tightening time windows, Talend Integration Suite MPx exceeds the most demanding requirements and supersedes all existing performance benchmarks.

FileScale Technology

Talend Integration Suite: FileScale Technology

Talend Integration Suite MPx features the unique FileScale technology which leverages the execution server hardware architecture and maximizes the performance of low-level sort algorithms.

The FileScale technology works in bulk mode on (very) large files. It takes full advantage of the execution architecture as it is not restricted by the JVM or execution engine limitations typical of traditional data integration architectures.

FileScale technology sorts and transforms data using innovative high-performance mathematical algorithms for data processing. It leverages the MapReduce architecture to automatically break down any data processing operation into a number of granular processes.


Talend Integration Suite RTx

Based on Talend's award winning enterprise open source data integration technology, Talend Integration Suite RTx is the real-time data integration platform of choice for enterprise application integration needs.

Today's companies live in an on-demand world, where data a few-hour old is already obsolete. Using low-latency data integration solutions to process data in real-time, stakeholders can be better informed and thus make better business-critical decisions.

Service-oriented architecture

Talend Integration Suite: open source data integration Service-oriented architecture

Talend Integration Suite RTx provides support for:

  • Data Integration Services: triggering or integrating data integration processes in real-time as the need arises, using Web Services.
  • Data Services: providing an easy and immediate access service to critical data that is usually difficult to access using standard protocols.

The administration console of Talend Integration Suite RTx offers a web-based and fully graphical environment to expose one or more data integration jobs as services (Web Services), enabling their automatic deployment in and across heterogenous applications and systems using SOAP binding (RPC or document-based). A dedicated WSDL wizard helps generate WDSL descriptors to expose Jobs as Web Services and find matching UDDI entries when consuming Web Services.

In addition, Talend Integration Suite RTx provides a native export to JBoss ESB for full interoperability between applications.

The SOA Manager also features an advanced capability of incoming request management based on an optimized pooling and queueing system. The user-defined pool of active services handles a number of requests in real-time, while a queue manager handles the additional requests, buffering the throughput, for an asynchronous processing.


Talend Open Profiler

Data profiling is the process of examining the data available in existing data sources (e.g. databases, applications, files, etc.) and collecting statistics and information about this data. Data profiling enables the assessment of the quality level of the data contained in the information system, according to a defined set of metrics and goals.

Talend Open Profiler is a sophisticated, yet simple-to-use open source data profiling tool that defines the content, structure, and quality of highly complex data structures. The open source data profiler allows business users and data management staff to perform a large variety of analyses using a set of indicators, patterns and rules for each data element being analyzed or monitored. It analyzes data on an ongoing basis, and analyzes changes to source data over time to help improve data quality.

These data quality indicators can range from simple or advanced statistics to text string analysis, including summary data and statistical distributions of records. The patterns are preset or customized expressions that define the expected form of data analyzed and the data quality rules help define custom business thresholds and value ranges.

Talend Open Profiler produces sophisticated reports and graphs that let users gauge at a glance the data quality, and the status of the predefined indicators. In addition an embedded data explorer allows users to directly drill down into the tables of the analyzed databases.

Metadata discovery

Talend Open Profiler: Metadata discovery

Talend Open Profiler connects to databases and files to introspect their structures and stores the description of their metadata in its Metadata Repository. The metadata is then used by data analysts to set up data quality metrics and indicators.

 


Talend Data Quality

Data quality entails more than helping companies get correct data into their information systems; it also means getting rid of bad, corrupted, or duplicate data. Clean data is a key element when integrating information across systems, because misinformation can proliferate quickly - internally of course, but also to business partners. With today's interconnected information systems, poor quality data spreads the same way viruses are spread by travelers: erroneous information can spread quickly to other applications. The cost of compromised data is incalculable, including lost sales, wasted productivity, loss of reputation or goodwill, and missed opportunities

All functionality is completely integrated with Talend Integration Suite, Talend's leading open source enterprise data integration solution, ensuring that data quality is built into the open source integration processes during the design phase.

Data Profiling

Talend Data Quality: open source Data Profiling

The first step in improving the quality of an enterprise's data is to "profile" (data profiling) or evaluate that data. Sophisticated, yet easy to use, the data profiler is an advanced UI-based system that does not require an understanding of database engines and file structures. Business analysts or other non-technical personnel can define a set of indicators, patterns and business rules for each data element that needs to be analyzed or monitored through the open source data profiling tool. These indicators can range from simple or advanced statistics, to pattern and soundex frequencies as well as text string and numeric analysis, including summary data and statistical distributions of records. The patterns are preset or customized expressions that define the expected form of data analyzed and the open source data quality business rules help define custom business thresholds and value ranges.

By reviewing the metrics on a regular basis, and following their evolution and trend, a company can follow the evolution (improvement or degradation) of the quality of its data through data profiling.

Other functionalities include:

  • History of data profiling analyses
  • Batch analyzing
  • Report stylesheet customization
  • Various report formats including PDF, HTML and XML.

 


Talend MDM

Talend MDM is the only solution that provides data integration, data quality, master data and data stewardship functionality on a single platform. It  leverages an active data model that offers the flexibility to model and manage any data.

To meet fundamental strategic objectives organizations must streamline processes, create new revenue opportunities, reduce costs and facilitate company growth.  Business analysts and IT organizations look to data as a key asset and vital component to reaching these objectives.

Talend Master Data Management

Built on open standards, Talend MDM is the first comprehensive solution for Data Integration, Data Quality, Master Data and Data Stewardship built on a single platform. Reducing complexity, it provides the necessary data stewardship tools that help teams efficiently collaborate on master data and meet data governance requirements.

Additionally, Talend's unique, flexible Active Data Model allows organizations to quickly model and master any data domain, not just customers or products, and systematically improve access and reliability across enterprise systems.

Talend MDM is available in two editions. Talend MDM Community Edition is provided under the GPL license and can be downloaded at no charge. Talend MDM Enterprise Edition is provided under a subscription license.

Hundreds of customers have successfully solved their data challenges with Talend's data integration and data quality solutions. Talend MDM presents a natural extension of the Talend product line.  Some of the key functional benefits of using Talend MDM include:

Rapid Implementation

Talend MDM can be implemented immediately to demonstrate and communicate quick, real value from master data management.

  • Complete MDM solution on single platform - Integrated solution for Data Integration, Data Quality, Master Data and Data Stewardship speeds up and simplifies implementation.
  • Data model driven integration - Master data events conditionally drive integration and synchronization with external systems, reducing system complexity and time to deploy.
  • Instant stewardship and governance - Dynamic collaborative MDM interface enables immediate authoring and stewardship of hub data.

Flexible Solution

Master data on your terms

  • Master any data domain - Intuitive data modeling tools allow you to define and master any domain without conforming to rigid, predefined data models.
  • Iterative implementation - XML based, the Active Data Model permits an iterative definition of the data model to gain alignment from business users and ensure adoption upon launch.
  • Incremental production - Changes and the evolution of the MDM data model can be applied in real time to associated services and authoring tools without downtime.

Open Source

  • Implement MDM today - Download Talend MDM Community Edition to start a real MDM project today, identify your requirements, address real architectural concerns and develop a long term MDM strategy
  • Extensibility and community improvement - Use standards based source code to extend or modify Talend MDM or search through a community for production ready extensions and MDM components to implement your unique solution.
  • Talend commitment to open source - Combine the advantages of open source (openness, freedom to modify, and community contributions) with the commitment of a commercial vendor (R&D, QA, continuous improvements, SLA, etc.)

This approach speeds and simplifies MDM implementations and provides flexibility to iterate and expand incremental MDM value over time.