5 SQL/JSON Conditions IS JSON and IS NOT JSON
SQL/JSON conditions is json
and is not json
are complementary. They test whether their argument is syntactically correct, that is, well-formed, JSON data. You can use them in a CASE
expression or the WHERE
clause of a SELECT
statement.
If the argument is syntactically correct then is json
returns true and is not json
returns false. If the argument cannot be evaluated for some reason (for example, if an error occurs during parsing) then the data is considered to not be well-formed: is json
returns false; is not json
returns true. Well-formed data means syntactically correct data. JSON data can be well-formed in two senses, referred to as strict and lax syntax.
See Also:
Oracle Database SQL Language Reference for information about is json
and is not json
.
- Unique Versus Duplicate Fields in JSON Objects
By default, field names need not be unique for a given JSON object. But you can specify that particular JSON data is to be considered well-formed only if none of its objects have duplicate field names. - About Strict and Lax JSON Syntax
The Oracle default syntax for JSON is lax. In particular: it reflects the JavaScript syntax for object fields; the Boolean and null values are not case-sensitive; and it is more permissive with respect to numerals, whitespace, and escaping of Unicode characters. - Specifying Strict or Lax JSON Syntax
The default JSON syntax for Oracle Database is lax. Strict or lax syntax matters only for SQL/JSON conditionsis json
andis not json
. All other SQL/JSON functions and conditions use lax syntax for interpreting input and strict syntax when returning output.
Related Topics
Parent topic: Store and Manage JSON Data
5.1 Unique Versus Duplicate Fields in JSON Objects
By default, field names need not be unique for a given JSON object. But you can specify that particular JSON data is to be considered well-formed only if none of its objects have duplicate field names.
The JSON standard does not specify whether field names must be unique for a given JSON object. This means that, a priori, a well-formed JSON object can have multiple members that have the same field name. This is the default behavior for handling JSON data in Oracle Database because checking for duplicate names takes additional time.
You can specify that particular JSON data is to be considered well-formed only if all of the objects it contains have unique field names, that is, no object has duplicate field names. You do this by using the keywords WITH UNIQUE KEYS
with SQL/JSON condition is json
.Foot 1
If you do not specify UNIQUE KEYS
, or if you use the keywords WITHOUT UNIQUE KEYS
, then objects can have duplicate field names and still be considered well-formed.
The evaluation that Oracle Database employs always uses only one of the object members that have a given field name; any other members with the same field name are ignored. It is unspecified which of multiple such members is used.
Whether duplicate field names are allowed in well-formed JSON data is orthogonal to whether Oracle uses strict or lax syntax to determine well-formedness.
Parent topic: SQL/JSON Conditions IS JSON and IS NOT JSON
5.2 About Strict and Lax JSON Syntax
The Oracle default syntax for JSON is lax. In particular: it reflects the JavaScript syntax for object fields; the Boolean and null values are not case-sensitive; and it is more permissive with respect to numerals, whitespace, and escaping of Unicode characters.
Standard ECMA-404, the JSON Data Interchange Format, and ECMA-262, the ECMAScript Language Specification, define JSON syntax.
According to these specifications, each JSON field and each string value must be enclosed in double quotation marks ("
). Oracle supports this strict JSON syntax, but it is not the default syntax.
In JavaScript notation, a field used in an object literal can be, but need not be, enclosed in double quotation marks. It can also be enclosed in single quotation marks ('
). Oracle also supports this lax JSON syntax, and it is the default syntax.
In addition, in practice, some JavaScript implementations (but not the JavaScript standard) allow one or more of the following:
-
Case variations for keywords
true
,false
, andnull
(for example,TRUE
,True
,TrUe
,fALSe
,NulL
). -
An extra comma (
,
) after the last element of an array or the last member of an object (for example,[a, b, c
,
]
,{a:b, c:d
,
}
). -
Numerals with one or more leading zeros (for example,
0042.3
). -
Fractional numerals that lack
0
before the decimal point (for example,.14
instead of0.14
). -
Numerals with no fractional part after the decimal point (for example,
342.
or1.e27
). -
A plus sign (
+
) preceding a numeral, meaning that the number is non-negative (for example,+1.3
).
This syntax too is allowed as part of the Oracle default (lax) JSON syntax. (See the JSON standard for the strict numeral syntax.)
In addition to the ASCII space character (U+0020), the JSON standard defines the following characters as insignificant (ignored) whitespace when used outside a quoted field or a string value:
-
Tab, horizontal tab (
HT
,^I
, decimal 9, U+0009,\t
) -
Line feed, newline (
LF
,^J
, decimal 10, U+000A,\n
) -
Carriage return (
CR
,^M
, decimal 13, U+000D,\r
)
The lax JSON syntax, however, treats all of the ASCII control characters (Control+0 through Control+31), as well as the ASCII space character (decimal 32, U+0020), as (insignificant) whitespace characters. The following are among the control characters:
-
Null (
NUL
,^@
, decimal 0, U+0000,\0
) -
Bell (
NEL
,^G
, decimal 7, U+0007,\a
) -
Vertical tab (
VT
,^K
, decimal 11, U+000B) -
Escape (
ESC
,^[
, decimal 27, U+001B,\e
) -
Delete (
DEL
,^?
, decimal 127, U+007F)
An ASCII space character (U+0020) is the only whitespace character allowed, unescaped, within a quoted field or a string value. This is true for both the lax and strict JSON syntaxes.
For both strict and lax JSON syntax, quoted object field and string values can contain any Unicode character, but some of them must be escaped, as follows:
-
ASCII control characters are not allowed, except for those represented by the following escape sequences:
\b
(backspace),\f
(form feed),\n
(newline, line feed),\r
(carriage return), and\t
(tab, horizontal tab). -
Double quotation mark (
"
), slash (/
), and backslash (\
) characters must also be escaped (preceded by a backslash):\"
,\/
, and\\
, respectively.
In the lax JSON syntax, an object field that is not quoted can contain any Unicode character except whitespace and the JSON structural characters — left and right brackets ([
, ]
) and curly braces ({
, }
), colon (:
), and comma (,
), but escape sequences are not allowed.
Any Unicode character can also be included in a name or string by using the ASCII escape syntax \u
followed by the four ASCII hexadecimal digits that represent the Unicode code point.
Note that other Unicode characters that are not printable or that might appear as whitespace, such as a no-break space character (U+00A0), are not considered whitespace for either the strict or the lax JSON syntax.
Table 5-1 shows some examples of JSON syntax.
Table 5-1 JSON Object Field Syntax Examples
Example | Well-Formed? |
---|---|
|
Lax and strict: yes. Space characters are allowed. |
|
Lax (and strict): no. Whitespace characters, including space characters, are not allowed in unquoted names. |
|
Lax and strict: yes. Escape sequence for tab character is allowed. |
|
Lax and strict: no. Unescaped tab character is not allowed. Space is the only unescaped whitespace character allowed. |
|
Lax and strict: yes. Escaped double quotation marks are allowed, if name is quoted. |
|
Lax and strict: no. Name must be quoted. |
|
Lax: yes, strict: no. Single-quoted names (object fields and strings) are allowed for lax syntax only. Escaped double quotation mark is allowed in a quoted name. |
|
Lax and strict: yes. Any Unicode character is allowed in a quoted name. This includes whitespace characters and characters, such as colon ( |
|
Lax (and strict): no. Structural characters are not allowed in unquoted names. |
Related Topics
See Also:
-
IETF RFC4627 and ECMA 404 for the syntax of JSON Data Interchange Format
-
ECMA International and JSON.org for more information about JSON and JavaScript
Parent topic: SQL/JSON Conditions IS JSON and IS NOT JSON
5.3 Specifying Strict or Lax JSON Syntax
The default JSON syntax for Oracle Database is lax. Strict or lax syntax matters only for SQL/JSON conditions is json
and is not json
. All other SQL/JSON functions and conditions use lax syntax for interpreting input and strict syntax when returning output.
If you need to be sure that particular JSON input data has strictly correct syntax, then check it first using is json
or is not json
.
You specify that data is to be checked as strictly well-formed according to the JSON standard by appending (STRICT)
(parentheses included) to an is json
or an is not json
expression.
Example 5-1 illustrates this. It is identical to Example 4-1 except that it uses (STRICT)
to ensure that all data inserted into the column is well-formed according to the JSON standard.
See Also:
Oracle Database SQL Language Reference for information about CREATE TABLE
Example 5-1 Using IS JSON in a Check Constraint to Ensure JSON Data is Strictly Well-Formed (Standard)
CREATE TABLE j_purchaseorder
(id VARCHAR2 (32) NOT NULL PRIMARY KEY,
date_loaded TIMESTAMP (6) WITH TIME ZONE,
po_document VARCHAR2 (32767)
CONSTRAINT ensure_json CHECK (po_document IS JSON (STRICT)));
Related Topics
Parent topic: SQL/JSON Conditions IS JSON and IS NOT JSON
Footnote Legend
Footnote 1: An object field is sometimes called an object “key”.