DBIx::Perlish manpage
- NAME
- VERSION
- SYNOPSIS
- DESCRIPTION
- Ideology
- Procedural interface
init()- db_fetch {}
- db_select {}
- db_update {}
- db_delete {}
db_insert()union()intersect()quirk()except()- $SQL and @BIND_VALUES
- Special treatment of the
$dbhvariable - Query sub syntax
- Table variables declarations
- Query filter statements
- Return statements
- Assignments
- Result limiting and ordering statements
- Conditional statements
- Statements with label syntax
- Compound queries' statements
- Subqueries
- Joins
- Object-oriented interface
- Working with multiple database handles
- Database driver specifics
- Implementation details and more ideology
- CONFIGURATION AND ENVIRONMENT
- DEPENDENCIES
- INCOMPATIBILITIES
- BUGS AND LIMITATIONS
- AUTHOR
- ACKNOWLEDGEMENTS
- SUPPORT
- LICENSE AND COPYRIGHT
NAME
DBIx::Perlish - a perlish interface to SQL databases
VERSION
This document describes DBIx::Perlish version 0.42
SYNOPSIS
use DBI;
use DBIx::Perlish;
my $dbh = DBI->connect(...);
DBIx::Perlish::init($dbh);
# selects:
my @rows = db_fetch {
my $x : users;
defined $x->id;
$x->name !~ /\@/;
};
# sub-queries:
my @rows = db_fetch {
my $x : users;
$x->id <- db_fetch {
my $t2 : table1;
$t2->col == 2 || $t2->col == 3;
return $t2->user_id;
};
$x->name !~ /\@/;
};
# updates:
db_update {
data->num < 100;
data->mutable;
data->num = data->num + 1;
data->name = "xyz";
};
# more updates:
db_update {
my $d : data;
$d->num < 100, $d->mutable;
$d = {
num => $d->num + 1,
name => "xyz"
};
};
# deletes:
db_delete {
my $t : table1;
!defined $t->age or
$t->age < 18;
};
# inserts:
my $id = 42;
db_insert 'users', {
id => $id,
name => "moi",
};
DESCRIPTION
The DBIx::Perlish module provides the ability to work with databases
supported by the DBI module using Perl's own syntax for four most
common operations: SELECT, UPDATE, DELETE, and INSERT.
By using DBIx::Perlish, you can write most of your database
queries using a domain-specific language with Perl syntax.
Since a Perl programmer knows Perl by definition,
and might not know SQL to the same degree, this approach
generally leads to a more comprehensible and maintainable
code.
The module is not intended to replace 100% of SQL used in your program. There is a hope, however, that it can be used to replace a substantial portion of it.
The DBIx::Perlish module quite intentionally neither implements
nor cares about database administration tasks like schema design
and management. The plain DBI interface is quite sufficient for
that. Similarly, and for the same reason, it does not take care of
establishing database connections or handling transactions. All this
is outside the scope of this module.
Ideology
There are three sensible and semi-sensible ways of arranging code that works with SQL databases in Perl:
- SQL sprinkling approach
-
One puts queries wherever one needs to do something with the database, so bits and pieces of SQL are intermixed with the program logic. This approach can easily become an incomprehensible mess that is difficult to read and maintain.
- Clean and tidy approach
-
Everything database-related is put into a separate module, or into a collection of modules. Wherever database access is required, a corresponding sub or method from such a module is called from the main program. Whenever something is needed that the DB module does not already provide, a new sub or method is added into it.
- Object-relational mapping
-
One carefully designs the database schema and an associated collection of classes, then formulates the design in terms of any of the existing object-relational mapper modules like
Class::DBI,DBIx::ClassorTangram, then uses objects which perform all necessary queries under the hood. This approach is even cleaner than ``clean and tidy'' above, but it has other issues. Some schemas do not map well into the OO space. Typically, the resulting performance is an issue as well. The performance issues can in some cases be alleviated by adding hand-crafted SQL in strategic places, so in this regard the object-relational mapping approach can resemble the ``clean and tidy'' approach.
The DBIx::Perlish module is meant to eliminate the majority
of the ``SQL sprinkling'' style of database interaction.
It is also fully compatible with the ``clean and tidy'' method.
Procedural interface
init()
The init() sub initializes procedural interface
to the module.
It accepts named parameters.
One parameter, dbh, is required and must be a valid DBI database handler.
All other parameters are silently ignored.
Alternatively, init() can be called with a single
positional parameter, in which case it is assumed to
be the DBI database handler.
If the supplied database handler is not valid, an exception is thrown.
This procedure does not return anything meaningful.
Examples:
my $dbh = DBH->connect(...);
DBIx::Perlish::init(dbh => $dbh);
my $dbh = DBH->connect(...);
DBIx::Perlish::init($dbh);
db_fetch {}
The db_fetch {} function queries and returns data from
the database.
The function parses the supplied query sub, converts it into the corresponding SQL SELECT statement, and executes it.
What it returns depends on two things: the context and the return statement in the query sub, if any.
If there is a return statement which specifies exactly one
column, and db_fetch {} is called in the scalar context,
a single scalar representing the requested column is returned
for the first row of selected data. Example:
my $somename = db_fetch { return user->name };
Borrowing DBI's terminology, this is analogous to
my $somename =
$dbh->selectrow_array("select name from user");
If there is a return statement which specifies exactly one
column, and db_fetch {} is called in the list context,
an array containing the specified column for all selected
rows is returned. Example:
my @allnames = db_fetch { return user->name };
This is analogous to
my @allnames =
@{$dbh->selectcol_arrayref("select name from user")};
When there is no return statement, or if the return statement specifies multiple columns, then an individual row is represented by a hash reference with column names as the keys.
In the scalar context, a single hashref is returned, which corresponds to the first row of selected data. Example:
my $h = db_fetch { my $u : user };
print "name: $h->{name}, id: $h->{id}\n";
In DBI parlance that would look like
my $h = $dbh->selectrow_hashref("select * from user");
print "name: $h->{name}, id: $h->{id}\n";
In the list context, an array of hashrefs is returned, one element for one row of selected data:
my @users = db_fetch { my $u : user };
print "name: $_->{name}, id: $_->{id}\n" for @users;
Again, borrowing from DBI, this is analogous to
my @users = @{$dbh->selectall_arrayref("select * from user",
{Slice=>{}})};
print "name: $_->{name}, id: $_->{id}\n" for @users;
There is also a way to specify that one or several of
the return values are the key fields, to obtain a behavior
similar to that of the DBI's selectall_hashref() function.
A return value is a key field if it is prepended with -k:
my %data = db_fetch {
my $u : users;
return -k $u->name, $u;
};
This is analogous to
my %data = %{$dbh->selectall_hashref(
"select name, * from users", "name")};
If the db_fetch {} containing key fields is called in the
scalar context, it returns a hash reference instead of a hash.
In both cases the complete result set is returned.
This is different from calling the db_fetch {} without key fields
in the scalar context, which always returns a single row (or a single
value), as explained above.
The db_fetch {} function will throw an exception if it is unable to
find a valid database handle to use, or if it is unable to convert its
query sub to SQL.
In addition, if the database handle is configured to throw exceptions, the function might throw any of the exceptions thrown by DBI.
Subqueries are permitted in db_fetch's query subs.
Please see Query sub syntax below for details of the syntax allowed in query subs.
The db_fetch {} function is exported by default.
db_select {}
The db_select {} function is an alias to the db_fetch {}.
It is exported by default.
db_update {}
The db_update {} function updates rows of a database table.
The function parses the supplied query sub, converts it into the corresponding SQL UPDATE statement, and executes it.
The function returns whatever DBI's do method returns.
The function will throw an exception if it is unable to find a valid database handle to use, or if it is unable to convert its query sub to SQL.
In addition, if the database handle is configured to throw exceptions, the function might throw any of the exceptions thrown by DBI.
A query sub of the db_update {} function must refer
to precisely one table (not counting tables referred to
by subqueries).
Neither return statements nor last statements are
allowed in the db_update {} function's query subs.
An attempt to call the db_update {} function with
no filtering expressions in the query sub will throw
an exception since such is very likely a dangerous mistake.
To allow such an update to proceed, include an exec
call with no parameters anywhere in the query sub.
Subqueries are permitted in db_update's query subs.
Please see Query sub syntax below for details of the syntax allowed in query subs.
Examples:
db_update {
tbl->id == 41;
tbl->id = tbl->id - 1;
tbl->name = "luff";
};
db_update {
tbl->id = 42;
exec; # without this an exception is thrown
};
db_update {
my $t : tbl;
$t->id == 40;
$t = {
id => $t->id + 2,
name => "LIFF",
};
};
db_update {
tbl->id == 40;
tbl() = {
id => tbl->id + 2,
name => "LIFF",
};
};
The db_update {} function is exported by default.
db_delete {}
The db_delete {} function deletes data from
the database.
The db_delete {} function parses the supplied query sub,
converts it into the corresponding SQL DELETE statement,
and executes it.
The function returns whatever DBI's do method returns.
The function will throw an exception if it is unable to find a valid database handle to use, or if it is unable to convert its query sub to SQL.
In addition, if the database handle is configured to throw exceptions, the function might throw any of the exceptions thrown by DBI.
A query sub of the db_delete {} function must refer
to precisely one table (not counting tables referred to
by subqueries).
Neither return statements nor last statements are
allowed in the db_delete {} function's query subs.
An attempt to call the db_delete {} function with
no filtering expressions in the query sub will throw
an exception since such is very likely a dangerous mistake.
To allow such a delete to proceed, include an exec
call with no parameters anywhere in the query sub.
Subqueries are permitted in db_delete's query subs.
Please see Query sub syntax below for details of the syntax allowed in query subs.
Examples:
db_delete { $x : users; exec; } # delete all users
# delete with a subquery
db_delete {
my $u : users;
$u->name <- db_fetch {
visitors->origin eq "Uranus";
return visitors->name;
}
}
The db_delete {} function is exported by default.
db_insert()
The db_insert() function inserts rows into a
database table.
This function is different from the rest because it does not take a query sub as the parameter.
Instead, it takes a table name as its first parameter, and any number of hash references afterwards.
For each specified hashref, a new row is inserted into the specified table. The resulting insert statement specifies hashref keys as the column names, with corresponding values taken from hashref values. Example:
db_insert 'users', { id => 1, name => "the.user" };
A value can be a call to the exported sql() function,
in which case it is inserted verbatim into the generated
SQL, for example:
db_insert 'users', {
id => sql("some_seq.nextval"),
name => "the.user"
};
The function returns the number of insert operations performed. If any of the DBI insert operations fail, the function returns undef, and does not perform remaining inserts.
The function will throw an exception if it is unable to find a valid database handle to use.
In addition, if the database handle is configured to throw exceptions, the function might throw any of the exceptions thrown by DBI.
The db_insert {} function is exported by default.
union()
This is a helper sub which is meant to be used inside
query subs. Please see Compound queries' statements
for details. The union() can be exported via :all
import declaration.
intersect()
This is a helper sub which is meant to be used inside
query subs. Please see Compound queries' statements
for details. The intersect() can be exported via :all
import declaration.
quirk()
Unfortunately it is not always possible to generate an
SQL statement which is valid for different DBI drivers,
even when the DBIx::Perlish module has the knowledge
about what driver is in use.
The quirk() sub exists to alleviate this problem in
certain situations by registering ``quirks''.
Please avoid using it if possible.
It accepts at least two positional parameters. The first parameter is the DBI driver flavor. The second parameter identifies a particular quirk. The rest of parameters are quirk-dependent.
It is a fatal error to attempt to register a quirk that is not recognized by the module.
Currently only Oracle has any quirks, which are listed below:
- table_func_cast
-
When table functions are used in Oracle, one sometimes gets an error ``ORA-22905: cannot access rows from a non-nested table item''. The solution recommended by Oracle is to do an explicit type cast to a correct type. Since the
DBIx::Perlishmodule has no way of knowing what the correct type is, it needs a little help. Thetable_func_castquirk requires two extra parameters, the name of a table function and the type to cast it to.
except()
This is a helper sub which is meant to be used inside
query subs. Please see Compound queries' statements
for details. The except() can be exported via :all
import declaration.
$SQL and @BIND_VALUES
The DBIx::Perlish module provides two global variables
(not exported) to aid in debugging.
The $DBIx::Perlish::SQL variable contains the text of
the SQL which was most recently generated by the procedures above
(except db_insert()).
The @DBIx::Perlish::BIND_VALUES array contains the bind values
to be used with the corresponding SQL code.
Special treatment of the $dbh variable
If the procedural interface is used, and the user did not
call init() before issuing any of the db_fetch {},
db_update {}, db_delete {} or db_insert {}, those
functions look for one special case before bailing out.
Namely, they try to locate a variable my $dbh, our $dbh,
or caller's package global $dbh,
in that order, in the scope in which they are used. If such
variable is found, and if it contains a valid DBI database
handler, they will use it for performing the actual query.
This allows one to write something like that, and expect the
module to do the right thing:
my $dbh = DBI->connect(...);
my @r = db_fetch { users->name !~ /\@/ };
The author cannot recommend relying on this feature in the
production code; if in doubt, call init() first
and you won't be unpleasantly surprized.
In order for this feature to be operational, the PadWalker
module must be installed.
Query sub syntax
The important thing to remember is that although the query subs have Perl syntax, they do not represent Perl, but a specialized ``domain specific'' database query language with Perl syntax.
A query sub can consist of the following types of statements:
-
table variables declarations;
-
query filter statements;
-
return statements;
-
assignments;
-
result limiting and ordering statements;
-
conditional statements;
-
statements with label syntax;
-
compound queries' statements.
The order of the statements is generally not important, except that table variables have to be declared before use.
Table variables declarations
Table variables declarations allow one to associate lexical variables with database tables. They look like this:
my $var : tablename;
It is possible to associate several variables with the same table; this is the preferable mechanism if self-joins are desired.
In case the table name is not known until runtime, it is also possible to write for example
my $var : table = $data->{tablename};
In this case the attribute ``table'' must be specified verbatim, and the name of the table is taken from the right-hand side of the assignment.
Last, but not least, a combination of verbatim ``table'' attribute with a nested db_fetch {} can be used to implement inline views:
my $var : table = db_fetch { ... };
In this case a select statement corresponding to
the nested db_fetch {} will represent the table.
Please note that not all database drivers support
this, although at present the DBIx::Perlish module
does not care and will generate SQL which will subsequently
fail to execute.
Another possibility for declaring table variables is described in Statements with label syntax.
Please note that db_update {} and db_delete {} must only refer to a single table.
Query filter statements
Query filter statements have a general form of Perl expressions.
Binary comparison operators, logical ``or'' (both high and lower
precedence form), matching operators =~ and !~, binary arithmetic
operators, string concatenation, defined(expr),
and unary ! are all valid in the filters.
There is also a special back-arrow, ``comes from'' <- binary
operator used for matching a column to a set of values, and for
subqueries.
Individual terms can refer to a table column using dereferencing
syntax
(one of tablename->column,
$tablevar->column,
tablename->$varcolumn, or
$tablevar->$varcolumn),
to an integer, floating point, or string constant, to a function
call, to next statement with an argument,
or to a scalar value in the outer scope (simple scalars,
hash elements, or dereferenced hashref elements chained to
an arbitrary depth are supported).
Inside constant strings, table column specifiers are interpolated; the result of such interpolation is represented as a sequence of explicit SQL concatenation operations. The variable interpolation syntax is somewhat different from normal Perl rules, which does not interpolate method calls. So it is perfectly legal to write
return "abc $t->name xyz";
When it is impossible to distinguish between the column name and the following characters, the hash element syntax must be used instead:
return "abc$t->{name}xyz";
Of course, one may want to avoid the trouble altogether and use explicit Perl concatenation in such cases:
return "abc" . $t->name . "xyz";
Please note that specifying column names as hash elements is only valid inside interpolated strings; this may change in the future versions of the module.
Please also note that column specifiers of
tablename->column form cannot be embedded into strings;
again, use explicit Perl concatenation in such cases.
Function calls can take an arbitrary number of arguments. Each argument to a function must currently be a term, although it is expected that more general expressions will be supported in the future. The function call appear verbatim in the resulting SQL, with the arguments translated from Perl syntax to SQL syntax. For example:
lower($t1->name) eq lower($t2->lastname);
The lc and uc builtin functions are translated to
lower and upper, respectively.
A special case is when sql() function (with a single
parameter) is called. In this case the parameter of the
function call inserted verbatim into the generated SQL,
for example:
db_update {
tab->state eq "new";
tab->id = sql "some_seq.nextval";
};
There is also a shortcut when one can use backquotes for verbatim SQL pieces:
db_update {
tab->state eq "new";
tab->id = `some_seq.nextval`;
};
A next statement with a (label) argument is interpreted as
an operator of getting the next value out of a sequence,
where the label name is the name of the sequence.
Syntax specific to the DBI driver will be used to represent
this operation. It is a fatal error to use such a statement
with DBI drivers which do not support sequences. For example,
the following is exactly equivalent to the example above,
except it is more portable:
db_update {
tab->state eq "new";
tab->id = next some_seq;
};
The ``comes from'' <- binary operator can be used in the
following manner:
my @ary = (1,2,3);
db_fetch {
tab->id <- @ary;
};
This is equivalent to SQL's IN list operator, where
the list comes from the @ary array. An array reference
or an anonymous array can also be used in place of the @ary
here.
The <- operator can also be used with Subqueries,
below.
Return statements
Return statements determine which columns are returned by a query under what names. Each element in the return statement can be either a reference to the whole table, an expression involving table columns, or a string constant, in which case it is taken as an alias to the next element in the return statement:
return ($table->col1, anothername => $table->col2);
If an element is a reference to the whole table, it is understood that all columns from this table are returned:
return ($t1->col1, $t1->col2, $t2);
Table references cannot be aliased by a name.
One can also specify a ``distinct'' or ``DISTINCT'' string constant in the beginning of the return list, in which case duplicated rows will be eliminated from the result set.
It is also permissible to use a next operator with a label
argument (see above) in return statements:
return next some_seq;
Return statements are only valid in db_fetch {}.
Query subs representing subqueries using the reverse arrow notation must have exactly one return statement returning exactly one column (see Subqueries below).
Assignments
Assignments can take two form: individual column assignments or bulk assignments. The former must have a reference to a table column on the left-hand side, and an expression like those accepted in filter statements on the right-hand side:
table1->id = 42;
$t->column = $t->column + 1;
The bulk assignments must have a table specifier on the left-hand side, and a hash reference on the right-hand side. The keys of the hash represent column names, and the values are expressions like those in the individual column assignments:
$t = {
id => 42,
column => $t->column + 1
};
or
tablename() = {
id => 42,
column => tablename->column + 1
};
Please note a certain ugliness in tablename() in the last example,
so it is probably better to either use table vars, or stick to the
single assignment syntax of the first example.
It is possible to intermix hashes and hashrefs dereferencings with verbatim key/value pairs in bulk assignments:
$t = {
id => 42,
column => $t->column + 1,
%$hashref_from_outer_scope
};
Please note that the right hand side of the bulk assignment must be an anonymouse hash reference. Thus, the following is invalid:
$t = $hashref_from_outer_scope;
Instead, write
$t = {%$hashref_from_outer_scope};
The latter emphasizes the fact that this is the bulk assignment, which is not clear from the former statement.
Assignment statements are only valid in db_update {}.
Result limiting and ordering statements
The last command can be used to limit the number of
results returned by a fetch operation.
If it stands on its own anywhere in the query sub, it means ``stop
after finding the first row that matches other filters'', so it
is analogous to LIMIT 1 in many SQL dialects.
It can also be used in conjunction with a range .. operator,
so that
last unless 5..20;
is equivalent to
OFFSET 5 LIMIT 16
The sort builtin can be used to specify the desired order
of the results:
sort $t->col1, $t->col2;
is equivalent to
ORDER BY col1, col2
In order to support the ordering direction, the sort expressions can be preceded by a literal string which must satisfy the pattern /^(asc)/i (for ascending order, which is the default), or /^(desc)/i for descending order:
sort desc => $t->col1, asc => $t->col2;
is equivalent to
ORDER BY col1 DESC, col2
Result limiting and ordering statements are only valid in db_fetch {}.
Conditional statements
There is a limited support for parse-time conditional expressions.
At the query sub parsing stage, if the conditional does not mention any tables or columns, and refers exclusively to the values from the outer scope, it is evaluated, and the corresponding filter (or any other kind of statement) is only put into the generated SQL if the condition is true.
For example,
my $type = "ICBM";
db_fetch {
my $p : products;
$p->type eq $type if $type;
};
will generate the equivalent to select * from products where type = 'ICBM',
while the same code would generate just select * from products if $type
were false.
Similarly,
my $want_z = 1;
db_fetch {
my $p : products;
return $p->x, $p->y unless $want_z;
return $p->x, $p->y, $p->z if $want_z;
};
will generate the equivalent of select x, y from products when
$want_z is false, and select x, y, z from products when
$want_z is true.
Statements with label syntax
There is a number of special labels which query sub syntax allows.
Specifying label distinct: anywhere in the query sub leads to duplicated
rows being eliminated from the result set.
Specifying label limit: followed by a number (or a scalar variable
representing a number) limits the number of rows returned by the query.
Specifying label offset: followed by a number N (or a scalar variable
representing a number N) skips first N rows from the returned result
set.
Specifying label order:, orderby:, order_by:,
sort:, sortby:, or sort_by:, followed by a list of
expressions will sort the result set according to the expressions.
For details about the sorting criteria see the documentation
for ORDER BY clause in your SQL dialect reference manual.
Before a sorting expression in a list one may specify one of the
string constants ``asc'', ``ascending'', ``desc'', ``descending'' to
alter the sorting order, for example:
db_fetch {
my $t : tbl;
order_by: asc => $t->name, desc => $t->age;
};
Specifying label group:, groupby:, or group_by:,
followed by a list of column specifiers is equivalent to
the SQL clause GROUP BY col1, col2, ....
The module implements an experimental feature which
in some cases allows one to omit the explicit
group_by: label. If there is an explicit return statement
which mentions an aggregate function alongside ``normal''
column specifiers, and that return statement does not
reference the whole table, and the explicit group_by: label
is not present in the query, the
DBIx::Perlish module will generate one automatically.
For example, the following query:
db_fetch {
my $t : tab;
return $t->name, $t->type, count($t->age);
};
will execute the equivalent of the following SQL statement:
select name, type, count(age) from tab group by name, type
The avg(), count(), max(), min(), and sub()
functions are considered to be aggregate.
Specifying label table: followed by a lexical variable
declaration, followed by an assignment introduces an alternative
table declaration syntax. The value of the expression on the right
hand side of the assignment is taken to be the name of the table:
my $data = { table => "mytable" };
db_fetch {
table: my $t = $data->{table};
};
This is useful if you don't know the names of your table until runtime.
All special labels are case insensitive.
Special labels are only valid in db_fetch {}.
Compound queries' statements
The SQL compound queries UNION, INTERSECT, and EXCEPT are supported using the following syntax:
db_fetch {
{
... normal query statements ...
}
compound-query-keyword
{
... normal query statements ...
}
};
Here compound-query-keyword is one of union,
intersect, or except.
This feature will only work if the use statement for
the DBIx::Perlish module was written with :all
export declaration, since union, intersect, and except
are subs that are not exported by default by the module.
It is the responsibility of the programmer to make sure that results of the individual queries used in a compound query are compatible with each other.
Subqueries
It is possible to use subqueries in db_fetch {}, db_update {}, and db_delete {}.
There are two variants of subqueries. The first one is a
call, as a complete statement,
to db_fetch {} anywhere in the body of the query sub.
This variant corresponds to the EXISTS (SELECT ...) SQL
construct, for example:
db_delete {
my $t : table1;
db_fetch {
$t->id == table2->table1_id;
};
};
Another variant corresponds to the column IN (SELECT ...) SQL
construct. It uses a special syntax with back-arrow <-
(read it as ``comes from''),
which signifies that the column specifier on the left gets
its values from whatever is returned by a db_fetch {} on
the right:
db_delete {
my $t : table1;
$t->id <- db_fetch {
return table2->table1_id;
};
};
This variant puts a limitation on the return statement in the sub-query query sub. Namely, it must contain a return statement with exactly one return value.
If the right-hand side of the ``comes from'' operator is a function call, the function is assumed to be a function potentially returning a set of values, or a ``table function'', in Oracle terminology. Such construct is converted into a driver-dependent subselect involving the table function:
db_fetch {
tbl->id <- tablefunc($id);
};
Joins
Joins are implemented similar to subqueries, using embedded db_fetch call to
specify a join condition. The join syntax is one of (the last two are
equivalent):
join $t1 BINARY_OP $t2;
join $t1 BINARY_OP $t2 => db_fetch { CONDITION };
join $t1 BINARY_OP $t2 <= db_fetch { CONDITION };
where CONDITION is an arbitrary expression using fields from $t1 and $t2
, and BINARY_OP is one of *,+,x,&,|,<,> operators,
which correspond to the following standard join types:
- Inner join
-
This corresponds to either of
*,&, andxoperators. Thedb_fetch {}condition for inner join may be omitted, in which case it degenerates into a cross join. - Full outer join
-
It is specified with
+or|. TheDBIx::Perlishmodule does not care that some database engines do not support full outer join, nor does it try to work around this limitation. - Left outer join
-
< - Right outer join
-
>
Example:
my $x : x;
my $y : y;
join $y * $x => db_fetch { $y-> id == $x-> id };
Object-oriented interface
new()
Constructs and returns a new DBIx::Perlish object.
Takes named parameter.
One parameter, dbh, is required and
must be a valid DBI database handler.
Another parameter which the new() understands is quirks,
which, if present, must be a reference to an array of anonymous
arrays, each corresponding to a single call to quirk().
Please see quirk() for details.
Can throw an exception if the supplied parameters are incorrect.
fetch()
An object-oriented version of db_fetch {}.
update()
An object-oriented version of db_update {}.
delete()
An object-oriented version of db_delete {}.
insert()
An object-oriented version of db_insert().
sql()
Takes no parameters. Returns the SQL string, most recently generated by database queries performed by the object. Returns undef if there were no queries made thus far.
Example:
$db->query(sub { $u : users });
print $db->sql, "\n";
The sql() sub can also be called in a procedural fashion,
in which case it serves the purpose of injecting
verbatim pieces of SQL into query subs
(see Query filter statements) or into the values
to be inserted via db_insert().
The sql() function is exported by default.
bind_values()
Takes no parameters. Returns an array of bind values that were used in the most recent database query performed by the object. Returns an empty array if there were not queries made thus far.
Example:
$db->query(sub { users->name eq "john" });
print join(", ", $db->bind_values), "\n";
quirk()
An object-oriented version of quirk().
Working with multiple database handles
There are several ways in which the DBIx::Perlish module can be used
with several different database handles within the same program:
- Using object-oriented interface
-
The advantage of this approach is that there is no confusion about which database handle is in use, since a DBIx::Perlish object is always created with an explicit database handle as a parameter to new().
-
The obvious disadvantage is that one has to explicitly use ``sub'' when specifying a query sub, so the syntax is unwieldy.
- Switching handles with init()
-
This will work, but it will add quite a bit of clutter to the code, especially if queries to multiple databases are intermixed with each other.
- Switching handles via manipulation of the
$dbhvariable -
This has the same disadvantage as the previous method. Besides, it looks like vodoo.
- Using special import syntax
-
It is possible to import differently named specialized versions of the subs normally exported by the
DBIx::Perlishmodule, which will use specified database handle. The syntax is as follows: -
use DBIx::Perlish; my $dbh = DBI->connect(...); -
my $foo_dbh = DBI->connect(...); use DBIx::Perlish prefix => "foo", dbh => \$foo_dbh; -
my $bar_dbh = DBI->connect(...); use DBIx::Perlish prefix => "bar", dbh => \$bar_dbh; -
my @default = db_fetch { ... }; my @foo = foo_fetch { ... }; my @bar = bar_fetch { ... }; -
The syntax and semantics of such specialized versions is exactly the same as with the normal db_fetch {}, db_select {}, db_update {}, db_delete {}, and db_insert(), except that they use the database handle specified in the
usestatement for all operations. As can be seen from the example above, the normal versions still work as intended, employing the usual mechanisms for determining which handle to use.
Database driver specifics
The generated SQL output can differ depending on the particular database driver in use.
MySQL
Native MySQL regular expressions are used if possible and if
a simple LIKE won't suffice.
Oracle
The function call sysdate() is transformed into sysdate
(without parentheses).
Selects without table specification are assumed to be selects from DUAL, for example:
my $newval = db_fetch { return `tab_id_seq.nextval` };
Table functions in Oracle are handled specially.
There are quirks (see quirk()) that can be registered for Oracle driver.
Postgresql
Native Postgresql regular expressions are used if possible and if
a simple LIKE won't suffice.
SQLite
Native Perl regular expressions are used with SQLite even for
simple match cases, since SQLite does not know how to optimize
LIKE applied to an indexed column with a constant prefix.
Implementation details and more ideology
To achieve its purpose, this module uses neither operator overloading nor source filters.
The operator overloading would only work if individual tables were
represented by Perl objects. This means that an object-relational
mapper like Tangram can do it, but DBIx::Perlish cannot.
The source filters are limited in other ways: the modules using them are often incompatible with other modules that also use source filtering, and it is very difficult to do source filtering when any degree of flexibility is required. Only perl can parse Perl!
The DBIx::Perlish module, on the other hand, leverages perl's ability
to parse Perl and operates directly on the already compiled Perl code.
In other words, it parses the Perl op tree (syntax tree).
The idea of this module came from Erlang. Erlang has a so called list comprehension syntax, which allows one to generate lists using generator expressions and to select the list elements using filter expressions. Furthermore, the authors of the Erlang database, Mnesia, hijacked this syntax for the purpose of doing database queries via a mechanism called parse transform. The end result was that the database queries in Erlang are expressed by using Erlang's own syntax.
I found this approach elegant, and thought ``why something like this cannot be done in Perl''?
CONFIGURATION AND ENVIRONMENT
DBIx::Perlish requires no configuration files or environment variables.
Running under the Devel::Cover manpage
When the DBIx::Perlish module detects that the current program
is being run under the Devel::Cover manpage,
it tries to cheat a little bit and feeds the Devel::Cover manpage
with false information to make those
query subs which were parsed by the module
to appear ``covered''.
This is done because the query subs are never executed, and thus would normally be presented as ``not covered'' by the the Devel::Cover manpage reporter. Although a developer has no trouble deciding to ignore such ``red islands'', he has to perform this decision every time he looks at the coverage data, which tends to become annoying rather quickly.
Currently, only statement and sub execution data are faked.
DEPENDENCIES
The DBIx::Perlish module needs at least perl 5.8.2, quite possibly
a somewhat higher version. I have only tested it on
5.8.8 and 5.8.4.
This module requires DBI to do anything useful.
In order to support the special handling of the $dbh variable,
PadWalker needs to be installed.
Other modules used used by DBIx::Perlish are included
into the standard Perl distribution.
INCOMPATIBILITIES
If you use DBIx::Perlish together with the HTML::Mason manpage,
you are likely to see warnings ``Useless use of ... in void context''
that Mason helpfully converts into fatal errors.
To fix this, edit your handler.pl and add the following line:
$ah->interp->ignore_warnings_expr("(?i-xsm:Subroutine .* redefined|Useless use of .+ in void context)");
Here $ah must refer to an instance of HTML::Mason::ApacheHandler
class.
Mason is to blame for this, since it disregards warnings' handlers installed by other modules.
BUGS AND LIMITATIONS
No bugs have been reported.
Please report any bugs or feature requests to
bug-dbix-perlish@rt.cpan.org, or through the web interface at
http://rt.cpan.org.
A number of features found in many SQL dialects is not supported.
The module cannot handle more than 100 tables in a single query sub.
Although variables closed over the query sub can be used in it, only simple scalars, hash elements, and dereferenced hasref elements are understood at the moment.
If you would like to see something implemented, or find a nice Perlish syntax for some SQL feature, please let me know!
AUTHOR
Anton Berezin <tobez@tobez.org>
ACKNOWLEDGEMENTS
Special thanks to Dmitry Karasik, who contributed code and syntax ideas on several occasions, and with whom I spent considerable time discussing this module.
I would also like to thank Henrik Andersen, Lars Thegler, and Phil Regnauld for discussions, suggestions and code contributions.
This work is in part sponsored by Telia Denmark.
SUPPORT
There is also the project website at http://dbix-perlish.tobez.org/
LICENSE AND COPYRIGHT
Copyright (c) 2007, Anton Berezin <tobez@tobez.org>. All rights reserved.
Redistribution and use in source and binary forms, with or without modification, are permitted provided that the following conditions are met:
1. Redistributions of source code must retain the above copyright notice, this list of conditions and the following disclaimer.
2. Redistributions in binary form must reproduce the above copyright notice, this list of conditions and the following disclaimer in the documentation and/or other materials provided with the distribution.
THIS SOFTWARE IS PROVIDED BY AUTHOR AND CONTRIBUTORS ``AS IS'' AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL AUTHOR OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
