Autonomy > Universal Repository Interfacing

Universal Repository Interfacing

Tags:  

Universal Repository Interfacing

Universal Repository Interfacing
Connectors enable automatic content aggregation, facilitating a unified solution across multiple data sources

Enterprises looking to exploit content must be able to simultaneously access a wide range of data sources, including unstructured data such as HTML pages, word processing documents, spreadsheets, electronic mail, as well as semi-structured data (XML) and structured data such as Oracle, Lotus Notes and ODBC compliant content, in addition to multimedia data including audio and video content.

Data Agnostic

Autonomy solutions are not reliant on any single file or data format. Autonomy handles all types of information and provides a range of highly scalable components that automatically aggregate more than 200 different content formats, from the most comprehensive range of repositories including:

  • Document Repositories
  • Relational Database Management Systems
  • Local File Systems
  • Internet Servers
  • E-mail Servers
  • Newsfeed Servers
  • Legacy Systems

Automatic Synchronization

Aggregation is the process of gathering, extraction and importing of content, metadata and security data from diverse information repositories for analysis by Autonomy's Intelligent Data Operating Layer. All Connectors keep an audit of the files aggregated, security entitlement (optional), recording modification, deletions and completion points allowing automatic data synchronization between Autonomy's infrastructure and the data source.

Security

In many cases it is not sufficient simply to aggregate a conceptual understanding of the information, the user security entitlement to view the documents must also be respected. Autonomy Connectors where applicable, aggregate not only the information but also exactly mirror the security entitlement required to deliver the right information to the right people according to who is entitled to see it.

Entirely Configurable

Depending on the network architecture and bandwidth limitations, Autonomy Connectors are entirely configurable enabling aggregation to be performed on a scheduled basis, in batch form or as multiple simultaneous jobs.

Benefits

  • Facilitates a unified solution across multiple data sources
  • Connects people and content automatically
  • Reduces labor expenses
  • Make informed decisions faster based on a wider range of information
  • Avoids duplication of effort and time wasted

Repositories & Databases Supported

Autonomy can supply standard Connectors (also referred to as Fetches) for a considerable number of proprietary data repositories and file formats. Autonomy supports many other document management systems, repositories and document formats. Contact us to find out more.

Legacy Compatibility

All of Autonomy's Connectors are capable of extracting ALL information contained in the repositories, including for example the metadata stored in database records, file records in document management systems and meta-information in internet and intranet pages. Furthermore, Autonomy delivers the additional benefits of being able to integrate with a whole host of Legacy Collaboration Systems in order to leverage the existing user-document relationships that reside within the legacy knowledge base architecture. Once stored within Autonomy's Intelligent Data Operating Layer, all applications built on the layer can take advantage of this metadata and the business rules they embody.

*(1) OMNI Fetch is a generic fetch framework which a customer or partner can use to implement their own Connector. OMNI Fetch™ allows you to download Documents from any type of local or remote repository. For more details about OMNI Fetch™ please refer to the OMNI Fetch™ technical brief.

Import Module

The import process is facilitated by the Import Module, which handles the importation of files that are pulled out of the various data repositories by the Connectors.

Features include:

The Import Module can execute various operations on the text that is extracted from the documents, to name but a few key features:

  • Provides a comprehensive tool kit of parameterized filter operations that can be used to extract relevant information from fields. E.g. the extract price operator is designed to make it easier to spider e-commerce websites
  • The Import Module can create logical fields by aggregating existing fields within the document e.g. can be used to generate a new title by combining several fields within the source document
  • Filtering Rules: The minimum and maximum size of imported documents can be specified in terms of the number of words and file size in bytes
  • Large documents can be spilt into multiple sections, by paragraph, anchor points or word limits.
  • Titles can be intelligently generated to avoid repetition
  • Summaries can be intelligently extracted
  • All existing Metadata can be extracted
 

0 Comments  Show recent to old
Post a comment


 RSS of this page