75 DBMS_HADOOP
The DBMS_HADOOP
package provides a PL/SQL procedure called CREATE_EXTDDL_FOR_HIVE()
, that creates an Oracle external table for a given hive table.
Big Data SQL needs to be correctly set up for DBMS_HADOOP
to work.
This chapter contains the following topics:
75.1 DBMS_HADOOP Overview
The DBMS_HADOOP package provides two procedures for creating an Oracle external table and for synchronizing the Oracle external table partitions.
These procedures are:
-
CREATE_EXTDDL_FOR_HIVE()
— creates an Oracle external table for a given hive table -
SYNCHRONIZE_PARTITIONS_FOR_HIVE()
— helps to synchronize the Oracle external table partitions with those in the corresponding hive table
75.2 DBMS_HADOOP Security Model
Users must have ALTER
privileges on the table for modifying external table parameters. In addition the ALTER
privileges, users must have READ
privileges for the directory object that contains the external data source and WRITE
privileges for the directory objects containing bad, log, and discard files.
Similar privileges are applicable to the partitioned external tables.
75.3 Summary of DBMS_HADOOP Subprograms
DBMS_HADOOP
includes the CREATE_EXTDDL_FOR_HIVE
procedure and SYNC_PARTITIONS_FOR_HIVE
procedure subprograms.
Table 75-1 DBMS_HADOOP Subprograms
Subprogram | Description |
---|---|
Given a hive table name, creates a text of DDL that can be executed to create an external table corresponding to the hive table |
|
Merges a given partitioned Hive table with an Oracle partitioned table, and replaces it with the merged table. |
|
Synchronizes the existing partitioning definition of a hive table in Oracle catalog |
75.3.1 CREATE_EXTDDL_FOR_HIVE Procedure
This procedure creates an Oracle external table for a given hive table.
Syntax
DBMS_HADOOP.CREATE_EXTDDL_FOR_HIVE ( cluster_id IN VARCHAR2, db_name IN VARCHAR2 := NULL, hive_table_name IN VARCHAR2, hive_partition IN BOOLEAN, table_name IN VARCHAR2 := NULL, perform_DDL IN BOOLEAN := FALSE, text_of_DDL OUT CLOB);
Parameters
Table 75-2 CREATE_EXTDDL_FOR_HIVE Procedure Parameters
Parameter | Description |
---|---|
|
Hadoop cluster ID |
|
Database where the hive table is located |
|
Name of the hive table |
|
If this argument is If the original hive table is not partitioned, |
|
Name of the Oracle external table |
|
If this argument is |
|
If the argument |
75.3.2 CREATE_HYBRID_PARTNED_TABLE Procedure
This procedure merges the partitioned Hive table with an Oracle partitioned table, and replace it with the merged table.
Syntax
DBMS_HADOOP.CREATE_HYBRID_PARTNED_TABLE ( cluster_id IN VARCHAR2, db_name IN VARCHAR2, hive_table_name IN VARCHAR2, table_name IN VARCHAR2, table_owner IN VARCHAR2, perform_ddl IN BOOLEAN := TRUE, text_of_ddl OUT CLOB);
Parameters
Table 75-3 CREATE_HYBRID_PARTNED_TABLE Procedure Parameters
Parameter | Description |
---|---|
|
Hadoop cluster ID |
|
Database where the partitioned Hive table is located |
|
Name of the partitioned Hive table |
|
Name of the partitioned Oracle table |
|
The owner of the partitioned Oracle table |
|
If the value of this parameter is If the value of this parameter is |
|
If the argument |
75.3.3 SYNCHRONIZE_PARTITIONS_FOR_HIVE Procedure
This procedure synchronizes the Oracle external table partitions with those in the corresponding hive table.
Syntax
DBMS_HADOOP.SYNCHRONIZE_PARTITIONS_FOR_HIVE ( table_name IN VARCHAR2, table_owner IN VARCHAR2);
Parameters
Table 75-4 SYNCHRONIZE_PARTITIONS_FOR_HIVE Procedure Parameters
Parameter | Description |
---|---|
|
Oracle external table |
|
Schema name |