Transpilation

How it works#

When you do this:

from giql import transpile

sql = transpile(
    "SELECT * FROM variants WHERE interval INTERSECTS 'chr1:1000-2000'",
    tables=["variants"],
)

print(sql)

The transpiler performs three main steps:

Parses the GIQL query into an abstract syntax tree (AST) to identify GIQL-specific operators
Transforms genomic operators into SQL predicates and Common Table Expressions (CTEs), and replace genomic pseudo-columns with actual column references
Generates SQL output from the modified AST

The result is a standard SQL query that can be consumed by an execution engine that is not genome-aware.

SELECT * FROM variants
WHERE "chrom" = 'chr1' AND "start" < 2000 AND "end" > 1000

Notably, the transpiler expands logical genomic range columns into physical column comparisons.

The Table configuration of "variants" tells GIQL which physical columns correspond to the logical interval column. The above example simply maps to the default column names: chrom, start, end.

Contents

Transpilation#

How it works#

Examples#