Universal Repository Interfacing
Universal Repository Interfacing
Connectors enable automatic content aggregation, facilitating a unified solution across multiple data sources
Enterprises looking to exploit content must be able
to simultaneously access a wide range of data sources, including
unstructured data such as HTML pages, word processing documents,
spreadsheets, electronic mail, as well as semi-structured data (XML)
and structured data such as Oracle, Lotus Notes and ODBC compliant
content, in addition to multimedia data including audio and video
content.
Data Agnostic
Autonomy solutions are not reliant on any single
file or data format. Autonomy handles all types of information and
provides a range of highly scalable components that automatically
aggregate more than 200 different content formats, from the most
comprehensive range of repositories including:
- Document Repositories
- Relational Database Management Systems
- Local File Systems
- Internet Servers
- E-mail Servers
- Newsfeed Servers
- Legacy Systems
Automatic Synchronization
Aggregation is the process of gathering,
extraction and importing of content, metadata and security data from
diverse information repositories for analysis by Autonomy's Intelligent
Data Operating Layer. All Connectors keep an audit of the files
aggregated, security entitlement (optional), recording modification,
deletions and completion points allowing automatic data synchronization
between Autonomy's infrastructure and the data source.
Security
In many cases it is not sufficient simply to
aggregate a conceptual understanding of the information, the user
security entitlement to view the documents must also be respected.
Autonomy Connectors where applicable, aggregate not only the
information but also exactly mirror the security entitlement required
to deliver the right information to the right people according to who
is entitled to see it.
Entirely Configurable
Depending on the network architecture and
bandwidth limitations, Autonomy Connectors are entirely configurable
enabling aggregation to be performed on a scheduled basis, in batch
form or as multiple simultaneous jobs.
Benefits
- Facilitates a unified solution across multiple data sources
- Connects people and content automatically
- Reduces labor expenses
- Make informed decisions faster based on a wider range of information
- Avoids duplication of effort and time wasted
Repositories & Databases Supported
Autonomy can supply standard Connectors (also
referred to as Fetches) for a considerable number of proprietary data
repositories and file formats. Autonomy supports many other document
management systems, repositories and document formats. Contact us to
find out more.
Legacy Compatibility
All of Autonomy's Connectors are capable of
extracting ALL information contained in the repositories, including for
example the metadata stored in database records, file records in
document management systems and meta-information in internet and
intranet pages. Furthermore, Autonomy delivers the additional benefits
of being able to integrate with a whole host of Legacy Collaboration
Systems in order to leverage the existing user-document relationships
that reside within the legacy knowledge base architecture. Once stored
within Autonomy's Intelligent Data Operating Layer, all applications
built on the layer can take advantage of this metadata and the business
rules they embody.
*(1) OMNI Fetch is a generic fetch framework
which a customer or partner can use to implement their own Connector.
OMNI Fetch™ allows you to download Documents from any type of local or
remote repository. For more details about OMNI Fetch™ please refer to
the OMNI Fetch™ technical brief.
Import Module
The import process is facilitated by the Import
Module, which handles the importation of files that are pulled out of
the various data repositories by the Connectors.
Features include:
The Import Module can execute various operations
on the text that is extracted from the documents, to name but a few key
features:
- Provides a comprehensive
tool kit of parameterized filter operations that can be used to extract
relevant information from fields. E.g. the extract price operator is
designed to make it easier to spider e-commerce websites
- The
Import Module can create logical fields by aggregating existing fields
within the document e.g. can be used to generate a new title by
combining several fields within the source document
- Filtering
Rules: The minimum and maximum size of imported documents can be
specified in terms of the number of words and file size in bytes
- Large documents can be spilt into multiple sections, by paragraph, anchor points or word limits.
- Titles can be intelligently generated to avoid repetition
- Summaries can be intelligently extracted
- All existing Metadata can be extracted