Integrated Development Environment
Centerprise 5 features a professional integration environment with extensive
drag and drop functionality and powerful development and debugging features. Key
IDE features include:
- Intuitive, clutter-free user interface that is continuously refined to
provide a natural flow and a familiar look and feel.
- Drag and drop user interface with single click operations for creating data
sources and destinations, auto mapping, and auto field creation.
- Unlimited undo/redo, cut/copy/paste, and other features common in
professional software products
- Use of context menus to provide handy access to relevant commands and
shortcuts
- Team-friendly project structure featuring integration with version control
to facilitate collaborative development
Data Transformation
- Define complex data flows using the full complement of built-in field and
record-level transformations. Built-in transformations include lookups,
expressions, functions, join, sort, merge, union, normalize, denormalize, route,
filter, and others.
- Single click WYSIWYG data preview and quick profile features enable testing and
debugging of dataflows.
- Define subflows to create reusable transformations that can be used in other
dataflows and subflows.
- Database write strategies provide fine-grained control over database writes.
Built-in strategies include diff processing, incremental update, slowly changing
dimensions, and rule-based.
- Add custom data sources, destinations, and transformations using the
extensibility APIs.
Data Quality and Profiling
Centerprise integrates data integration, data quality, and profiling features in
a single environment helping creation of data integration jobs with built-in
profiling, quality measurement, and data cleansing. Profiling and data quality
features include:
- Data Quality mode provides the functionality to track record level errors and
route or process records based on errors.
- Validation Rules transformations can be used to validate data against business
rules and add errors to individual records.
- Data profile transformations provide aggregate statistics and record level error
information.
Job Flow, Scheduling and Execution
- Visual workflow designer that provides the functionality for defining task
flows, branching, and dependencies. Built-in workflow tasks include FTP upload
and download, file system actions, send mail, run programs, execute SQL scripts,
and the ability to run dataflows and workflows.
- Ability to restart jobs from the failure point.
- Extensive parameterization features to facilitate deployment and reusability.
- Built-in scheduler provides the ability to trigger jobs based on a specific
schedule (hourly, daily, weekly, and monthly). Jobs can also be triggered using
file drops and API calls.
Extensibility and Openness
- .Net APIs to add custom functions, sources, destinations, and transformations
- APIs to interact with a server for starting, monitoring, and cancelling jobs.
- All system documents including dataflows, workflows, data models, and others are
stored as standard XML
Performance and Scalability
- High-performance parallel processing engine that takes full advantage of
multicore and multiprocessor hardware to deliver superior performance and
scalability
- Server performance scales virtually in direct proportion to processing power
available. Doubling the number of CPUs or cores usually results in doubling the
throughput.
- Native bulk load support for popular databases is designed to efficiently
process very high data volumes