Brackit

A powerful JSONiq engine for querying JSON and XML

Use it standalone like jq, or embed it in your data store

Why Brackit?

Two ways to use it:

Command-line tool (bjq) - Like jq, but with FLWOR expressions, joins, and user-defined functions
Embeddable query engine - Add JSONiq queries to your data store with automatic optimizations

# Query JSON from the command line
echo '{"users": [{"name": "Alice", "age": 30}, {"name": "Bob", "age": 25}]}' | \
  bjq 'for $u in $$.users[] where $u.age > 26 return $u.name'

"Alice"

Performance

bjq native binary (Oracle GraalVM, PGO, G1 GC, -O3) vs jq 1.7 — wall-clock, median of 3 runs:

1M records (173 MB flat array):

Query	bjq	jq	Speedup
Filter (`age > 40 and active`)	2.4s	3.7s	1.6x
Group by dept + 3 aggregates	1.4s	7.0s	4.9x
Group by 2 keys + sort	1.5s	6.0s	3.9x
Hash-join (1M customers × 5M orders)	44.4s	— (O(n·m), skipped)	—
Join + group + aggregate + sort	14.7s	— (skipped)	—
5-way aggregation	1.7s	4.9s	2.9x
String equality filter	2.0s	3.6s	1.8x
Top-N (order + slice)	1.2s	5.8s	4.8x
Compound AND filter	2.8s	4.0s	1.4x
Count distinct	1.5s	5.9s	3.9x
Multi-key group + top-N	5.8s	39.6s	6.8x

100k records (18 MB):

Query	bjq	jq	Speedup
Filter	314ms	376ms	1.2x
Group by dept + 3 aggregates	249ms	634ms	2.5x
Group by 2 keys + sort	249ms	543ms	2.2x
5-way aggregation	205ms	466ms	2.3x
Top-N	240ms	517ms	2.2x
Count distinct	184ms	496ms	2.7x
Multi-key group + top-N	968ms	3.2s	3.3x

Reproducible: ./examples/benchmark.sh --sizes "10000 100000 1000000" (or benchmark-3way.sh for native vs JAR vs jq).

Scaling: 1M → 100M → 1B records — 9 query shapes × 3 engines

Compact 48-byte schema ({"age":N,"dept":"X","city":"X","active":T}), Linux, 20 cores, NVMe SSD. Native binary built with PGO trained on all 9 query shapes, -H:+VectorAPISupport, G1 GC, -O3. JAR run with HotSpot JIT.

1M records (50 MB):

Query	native + PGO	JAR (JVM)	jq
filter-count	503ms	603ms	1.5s
group-by dept	718ms	657ms	3.8s
group-by 2 keys	828ms	711ms	5.4s
filter + group-by	727ms	649ms	2.7s
count distinct	638ms	584ms	3.6s
sum(age)	475ms	561ms	1.3s
avg(age)	473ms	577ms	1.3s
min(age)	469ms	550ms	1.2s
max(age)	467ms	563ms	1.2s

100M records (4.9 GB):

Query	native + PGO	JAR (JVM)	jq
filter-count	1.5s	1.5s	OOM
group-by dept	1.7s	2.9s	OOM
group-by 2 keys	1.6s	2.6s	OOM
filter + group-by	1.6s	2.7s	OOM
count distinct	1.6s	2.8s	OOM
sum(age)	1.5s	1.7s	OOM
avg(age)	1.4s	1.7s	OOM
min(age)	1.5s	1.6s	OOM
max(age)	1.5s	1.6s	OOM

1B records (49 GB):

Query	native + PGO	JAR (JVM)	jq
filter-count	13.6s	13.4s	OOM
group-by dept	14.7s	14.8s	OOM
group-by 2 keys	14.5s	14.7s	OOM
filter + group-by	15.1s	15.2s	OOM
count distinct	14.5s	15.3s	OOM
sum(age)	13.1s	13.4s	OOM
avg(age)	13.0s	13.5s	OOM
min(age)	13.4s	13.7s	OOM
max(age)	13.4s	13.4s	OOM

jq OOMs at 100M records (exit 137, OOM-killed by the kernel). Both Brackit paths scale linearly: 1M to 100M is ~3x time for 100x data (cache warm-up + per-thread mmap), 100M to 1B is ~9x time for 10x data — at 1B we're at ~3.4 GB/s, approximately raw NVMe sequential read speed. Pick native for CLI/startup latency, JVM when embedding Brackit as a library.

Reproducible: examples/Gen1B.java (compact 48-byte generator, ~250 MB/s) + examples/benchmark-1b.sh.

Quick Start

Option 1: Native Binary (fastest)

Download the pre-built binary for your platform:

# Linux (x86-64)
curl -L https://github.com/sirixdb/brackit/releases/latest/download/bjq-linux-amd64 -o bjq
chmod +x bjq
sudo mv bjq /usr/local/bin/

# Linux (ARM64)
curl -L https://github.com/sirixdb/brackit/releases/latest/download/bjq-linux-arm64 -o bjq
chmod +x bjq
sudo mv bjq /usr/local/bin/

# macOS (Apple Silicon)
curl -L https://github.com/sirixdb/brackit/releases/latest/download/bjq-macos-arm64 -o bjq
chmod +x bjq
sudo mv bjq /usr/local/bin/

# macOS (Intel)
curl -L https://github.com/sirixdb/brackit/releases/latest/download/bjq-macos-amd64 -o bjq
chmod +x bjq
sudo mv bjq /usr/local/bin/

# Windows (x86-64) - download bjq-windows-amd64.exe from GitHub Releases

Then use it:

echo '{"name": "Alice"}' | bjq '$$.name'

# FLWOR expressions - the killer feature!
bjq 'for $u in $$.users[] where $u.age > 21 order by $u.name return $u' data.json

Option 2: Java Jar

Requires Java 25 or later. Download the jar from GitHub Releases, then:

alias bjq='java --enable-preview --add-modules=jdk.incubator.vector -jar /path/to/bjq-jar-with-dependencies.jar'
bjq 'for $u in $$.users[] where $u.age > 21 return $u' data.json

Option 3: Build from Source

Requires Java 25 or later.

git clone https://github.com/sirixdb/brackit.git
cd brackit
mvn package

# Set up bjq alias
alias bjq='java --enable-preview --add-modules=jdk.incubator.vector -jar '$(pwd)'/target/bjq-jar-with-dependencies.jar'

# Try it out - FLWOR with grouping!
echo '[{"cat":"A","v":1},{"cat":"B","v":2},{"cat":"A","v":3}]' | \
  bjq 'for $x in $$[] group by $c := $x.cat return {$c: sum($x.v)}'

Features at a Glance

Feature	Example
Field access	`$$.users[0].name`
Array iteration	`$$.items[].price`
Python-style slices	`$$[0:5]`, `$$[-1]`, `$$[::2]`
Object projection	`$${name, email}`
Predicates	`$$.users[][?$$.active]`
FLWOR expressions	`for $x in $$ where $x.age > 21 return $x`
User-defined functions	`declare function local:double($x) { $x * 2 }`
Automatic join optimization	Hash-joins for FLWOR with multiple `for` clauses
JSON updates	`insert`, `delete`, `replace`, `rename`

Mutable JSON with Update Expressions

Brackit supports the full JSONiq Update Facility - modify JSON data with declarative expressions:

(: Insert fields into an object :)
insert json {"status": "active", "updated": current-dateTime()} into $user

(: Append to an array :)
append json $newItem into $order.items

(: Update a value :)
replace json value of $product.price with $product.price * 0.9

(: Remove a field :)
delete json $user.temporaryToken

(: Rename a field :)
rename json $record.oldFieldName as "newFieldName"

This makes Brackit ideal for data stores that need to expose update capabilities through a query language.

The Power of FLWOR

Unlike simple path-based query languages, Brackit supports full FLWOR expressions (for, let, where, order by, return) - the SQL of JSON:

(: Group sales by category and compute totals :)
for $sale in $$.sales[]
let $cat := $sale.category
group by $cat
order by sum($sale.amount) descending
return {
  "category": $cat,
  "total": sum($sale.amount),
  "count": count($sale)
}

(: Join orders with customers - automatically optimized! :)
for $order in $$.orders[], $customer in $$.customers[]
where $order.customer_id eq $customer.id
return {
  "order": $order.id,
  "customer": $customer.name,
  "total": $order.total
}

bjq: The jq Alternative

bjq provides a familiar jq-like interface with JSONiq power:

# Basic field access
bjq '$$.name' data.json

# Array operations
bjq '$$.users[].email' data.json
bjq '$$[0:5]' data.json              # First 5 elements
bjq '$$[-1]' data.json               # Last element

# Filtering
bjq 'for $u in $$.users[] where $u.active return $u' data.json

# Aggregation
bjq 'sum($$.prices[])' data.json

# Raw output (no quotes)
bjq -r '$$.name' data.json

# Compact output
bjq -c '$$' data.json

Embed in Your Data Store

Brackit is designed as a retargetable query compiler. Data stores can plug in their own:

Physical optimizations (index scans, specialized operators)
Storage backends (your custom Node/Item implementations)
Rewrite rules (index matching, predicate pushdown)

// Minimal example: run a query in Java
QueryContext ctx = new BrackitQueryContext();
Query query = new Query("for $i in 1 to 10 return $i * $i");
query.serialize(ctx, System.out);

The optimizer automatically applies:

Hash-joins for multi-variable FLWOR expressions
Predicate pushdown
Constant folding
And more...

Installation

Maven

<dependency>
  <groupId>io.sirix</groupId>
  <artifactId>brackit</artifactId>
  <version>0.7</version>
</dependency>

Gradle

dependencies {
    implementation 'io.sirix:brackit:0.7'
}

JSONiq Syntax

Arrays

[ 1, 2, 3 ]                          (: literal array :)
[ =(1 to 5) ]                        (: spread: [1, 2, 3, 4, 5] :)
$arr[0]                              (: index access (0-based!) :)
$arr[-1]                             (: last element :)
$arr[1:3]                            (: slice :)
$arr[]                               (: unbox to sequence :)

Objects

{ "name": "Alice", "age": 30 }       (: literal object :)
$obj.name                            (: field access :)
$obj{name, age}                      (: projection :)
{ $obj1, $obj2 }                     (: merge objects :)

Updates (for mutable stores)

insert json {"new": "field"} into $obj
delete json $obj.field
replace json value of $obj.name with "Bob"
rename json $obj.old as "new"

Differences from Standard JSONiq

Array indexes start at 0 (not 1)
Object projection: $obj{field1, field2} instead of jn:project()
Python-style array slices: $arr[start:end:step]
Statement syntax with semicolons (syntactic sugar for let-bindings)

Community

Join us on Discord to ask questions, share ideas, or contribute!

Used By

SirixDB - A bitemporal, append-only database storing JSON and XML with full version history at the node level

Origins & Publications

Brackit was created by Sebastian Bächle during his PhD at TU Kaiserslautern, researching query processing for semi-structured data. It's now maintained as part of the SirixDB project.

Separating Key Concerns in Query Processing - Ph.D thesis by Dr. Sebastian Bächle
Unleashing XQuery for Data-independent Programming
XQuery Processing over NoSQL Stores

License

New BSD License

Name		Name	Last commit message	Last commit date
Latest commit History 973 Commits
.github/workflows		.github/workflows
.mvn/wrapper		.mvn/wrapper
examples		examples
src		src
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE.txt		LICENSE.txt
README.MD		README.MD
logs.json		logs.json
mission.md		mission.md
mvnw		mvnw
mvnw.cmd		mvnw.cmd
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Brackit

Why Brackit?

Performance

Scaling: 1M → 100M → 1B records — 9 query shapes × 3 engines

Quick Start

Option 1: Native Binary (fastest)

Option 2: Java Jar

Option 3: Build from Source

Features at a Glance

Mutable JSON with Update Expressions

The Power of FLWOR

bjq: The jq Alternative

Embed in Your Data Store

Installation

Maven

Gradle

JSONiq Syntax

Arrays

Objects

Updates (for mutable stores)

Differences from Standard JSONiq

Community

Used By

Origins & Publications

License

About

Uh oh!

Releases 5

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Brackit

Why Brackit?

Performance

Scaling: 1M → 100M → 1B records — 9 query shapes × 3 engines

Quick Start

Option 1: Native Binary (fastest)

Option 2: Java Jar

Option 3: Build from Source

Features at a Glance

Mutable JSON with Update Expressions

The Power of FLWOR

bjq: The jq Alternative

Embed in Your Data Store

Installation

Maven

Gradle

JSONiq Syntax

Arrays

Objects

Updates (for mutable stores)

Differences from Standard JSONiq

Community

Used By

Origins & Publications

License

About

Topics

Resources

License

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 5

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages