Skip Headers
Oracle® R Enterprise User's Guide
Release 11.2 for Linux, Solaris, AIX, and Windows

Part Number E26499-05
Go to Documentation Home
Home
Go to Book List
Book List
Go to Table of Contents
Contents
Go to Index
Index
Go to Master Index
Master Index
Go to Feedback page
Contact Us

Go to previous page
Previous
Go to next page
Next
PDF · Mobi · ePub

1 Overview of Oracle R Enterprise

R is an open source statistical programming language and environment. For information about R, see the R Project for Statistical Computing at http://www.r-project.org.

R provides an environment for statistical computing, including:

R's rapid adoption has earned it a reputation as a new statistical software standard.

Oracle R Enterprise is a component of the Oracle Advanced Analytics Option of Oracle Database Enterprise Edition. For detailed information about Oracle R Enterprise, including links to software downloads, go to Oracle R Enterprise at http://www.oracle.com/technetwork/database/options/advanced-analytics/r-enterprise/index.html.

Oracle R Enterprise allows users to perform statistical analysis on data stored in tables in an Oracle Database. Oracle R Enterprise has these components:

The components of Oracle R Enterprise are described in Chapter 3.

Oracle R Connector for Hadoop is a related product that is part of the Big Data Appliance.

Oracle R Enterprise also includes functions that perform most common or base statistical procedures; see Chapter 4 for more information.

The rest of this chapter describes Oracle R Enterprise Architecture, Oracle R Enterprise Data Types, and Oracle R Enterprise Supported Configurations.

Oracle R Enterprise Architecture

Oracle R Enterprise has these three components including the connector for Hadoop:

Description of oreug_vm_001.png follows
Description of the illustration oreug_vm_001.png

  1. The Client R Engine is a collection of R packages that allows you to connect to an Oracle Database and to interact with data in that database.

    You can use any R commands from the client. In addition, the client supplies these functions:

    • The R SQL Transparency framework intercepts R functions for scalable in-database execution

    • Functions intercept data transforms, statistical functions, and Oracle R Enterprise-specific functions

    • Interactive display of graphical results and flow control as in open source R

    • Submission of R closures (functions) for execution in the Oracle Database

  2. The Server is a collection of PL/SQL procedures and libraries that augment Oracle Database with the capabilities required to support an Oracle R Enterprise client. The R engine is also installed on Oracle Database to supported embedded R execution. Oracle Database spawns R engines, which can provide data parallelism.

    The Oracle R Enterprise Database engine provides this functionality:

    • Scale to large datasets

    • Access to tables, views, and external tables in the database, as well as those accessible through database links

    • Use SQL query parallel execution

    • Use in-database statistical and data mining functionality

  3. R Engines spawned by Oracle Database are spawned to support database-managed parallelism; provide lights-out scheduled execution of R scripts, that is, scheduling or triggering R scripts packaged inside a PL/SQL or SQL query. Oracle R Enterprise provides efficient transfer to and from the spawned engines. Embedded R execution can be used to emulate MapReduce style programming.

There are several data types specific to Oracle R Enterprise; see Oracle R Enterprise Data Types for details.

Oracle R Connector for Hadoop

Oracle R Connector for Hadoop (ORHC) is an R package that provides an interface between the local R environment and Hadoop. You install and load this package just as you would any other R package. Using R functions, you can copy data between R memory, the local file system, and HDFS. You can schedule R programs to execute as Hadoop MapReduce jobs and return the results to any of those locations.

ORHC is preinstalled on Oracle Big Data Appliance, but it is licensed separately as one of the Oracle Big Data Connectors. You can install ORHC on a Hadoop cluster other than one on an Oracle Big Data Appliance.

For information about ORCH, see the Oracle Big Data Connectors User's Guide (http://docs.oracle.com/cd/E27101_01/doc.10/e27365/toc.htm), part of the Oracle Big Data Documentation library (http://docs.oracle.com/cd/E27101_01/index.htm).

Oracle R Enterprise Data Types

Oracle R Enterprise introduces a variant to many R data types. The name of the Oracle R Enterprise data type is the name of the corresponding R data type prefixed by ore. These data types establish a mapping between an R object and a database table or view. The mapping tracks metadata of the Oracle object which in turn aids in SQL query generation. These data types form the foundation of the Oracle R Enterprise transparency layer.

The following R data types have been overloaded for transparent in-database execution:

  • Character, Integer, Numeric and Logical vectors

  • Factors

  • Data Frame

  • Matrix is overloaded in two situations:

    • Linear algebra cross-products

    • Creating input matrices for advanced analytics

For more information and examples, see Oracle R Enterprise Transparency Framework .

Oracle R Enterprise Supported Configurations

Oracle R Enterprise consists of a client and a server. The client and the server run on Microsoft Windows (32-bit and 64-bit), Oracle Linux, Red Hat Linux, Solaris, or IBM AIX. The server is installed in an Oracle Database, to which the client connects.

Oracle R Enterprise also runs on Oracle Exadata machines with the Linux or Solaris operating system and on SPARC SuperCluster. For details, see Prerequisites.

Installation of Oracle R Enterprise is described in Chapter 2.