By Peter Kreußel and Nils Magnus
Most state-of-the-art websites include some form of server-side database, but developers building HTML pages onto CD-ROMs have fewer options. Other situations also might favor keeping the data and program logic on the client.
Enter PublicSQL [1], a JavaScript program that acts as a simple SQL database for access from JavaScript apps. The JavaScript security model doesn't let the script store new data because the browser is not allowed to use JavaScript to write to disk, which means that PublicSQL is better suited for read-only settings.
Although HTML 5 has its own brand of client-side SQL database within the browser that is similar to, and actually better than, PublicSQL, it is not yet supported by most browsers, so PublicSQL should be useful for at least three more years [2].
The PublicSQL database can handle predefined data with familiar SQL commands (see the "Installation and Use" box). PublicSQL supports SELECT
statements with WHERE
and ORDER BY
clauses (Table 1). In addition to string and numeric fields, tables can contain date columns. Multiple tables can be linked with inner joins, and the instruction set includes a DISTINCT
instruction. The feature scope is not huge, but many tasks difficult to handle with pure JavaScript shrink to a couple of lines of code if you use PublicSQL. A sample application that searches the fields of a database for fruits and vegetables (Figure 1) demonstrates this. Figure 2 shows the schematic layout. The whole application is available for downloading on the Linux Magazine website [3], so you can try it out locally.
Field Selection
drop-down box and the input box for the search box appear on the page.PublicSQL needs a JavaScript-accessible file for each table. The file can either reside on the local filesystem - as with a data CD - or be loaded retrospectively with HTTP. Each file contains an array you can create easily from CSV files in production use; you just need to add two lines at the beginning and end [4].
Installation and Use |
The installation of PublicSQL, whose author, Jörg Siebrands, placed under the MIT License is quite simple. Just download a single file, <script type="text/javascript" src="publicsql.js"></script> and store the JavaScript file that you downloaded in the same directory. Alternatively, you can specify http://www.publicsql.org/publicsql.js as the source. This means that the application will always use the latest version; however, you risk the path changing sometime in the future or not having a network connection when you need it. |
Listing 1 is a simple example of HTML with PublicSQL. If no server exists to respond with a new page to values sent by POST
or GET
, a permanently open page can display the results - working as a search form at the same time. Lines 7 through 11 define a form with just two buttons: the + Search criterion button (line 8) uses the add_searchfield()
event handler linked to it to add a line with a search condition (Figure 1). Clicking the Execute search button (line 10) starts the process. Below the form in the code is an empty div
tag (line 13) into which the execute_search()
function drops the search results.
To start a search, click on + Search criterion. To insert a list field, select the field you want to search (Figure 1, left). The add_searchfield()
event handler (Listing 2) handles the insertion. code
populates the first line of the function with the opening tag of a drop-down list field: <select name="searchindex" onclick="add_input(Index, this)">
. The onclick
event handler here is explained in the next section. A loop (lines 10-14) adds an option
tag for each element in the fieldnames
array
<option value="0">[please select]</option> <option value="1">Fruit or vegetable</option>
and so on. Line 15 closes the select
tag and adds an empty span
tag with the ID input_search fieldcount
, making it easier for add_input()
to insert the input field in the second column later. The fieldcount
line counter helps keep the divs apart.
Listing 2: Dynamic List Field |
01 var fieldcount=0; 02 var fieldnames=[ "[Please select]", 03 "Fruit or vegetable", "Color", 04 "Taste", "Size" ]; 05 06 function add_searchfield() { 07 var code = '\n<select name="search' + 08 fieldcount + '" onclick="add_input(' + 09 fieldcount + ', this)">' + '\n'; 10 for (i in fieldnames) { 11 code += '<option value="' + i + 12 '">' + fieldnames[i] + 13 '</option>\n'; 14 } 15 code += '</select>\n<span id="input_search' + 16 fieldcount + '"></span>'; 17 fieldcount++; 18 var newSpan=document.createElement('div'); 19 newSpan.innerHTML=code; 20 document.getElementById("inputarea"). 21 appendChild(newSpan); 22 } |
Using a JavaScript Event to Trigger Queries |
To handle queries, developers might add a publicSQL.query("select * from projects", "publicSQL.show"); The library supports a simplified SQL dialect for the query (see Table 1). The individual relations, /* Portable Table Format 0.9 */ porTables[porTables.length] = new Array("country", "sum", "activity", "Germany", 100000, "Tunnel building", "Belgium", 23456, "Promotional project", [...], "France", 76543, "Infrastructure", 3); The first two lines find the format; the third line contains the attribute headers. Other lines contain the comma-separated data pairs. The final line contains the number of attributes per line. This makes it really easy to convert JSON or CSV files to this format. Because a PTF file contains technically valid JavaScript, it need not necessarily have all the values of a tuple in a row. However, some additional tools outside of PublicSQL impose this condition. |
In the drop-down field, the user can select from a list of search fields defined in the code (Figure 1). Because the value
tag the field created in line 11 contains a numeric value between 0
and 4
. Clicking the drop-down field calls the onclick
handler with fieldcount
and a reference to the drop-down field that called it (this
) - all of this is pure theory so far because it is not until lines 18 through 21 that the HTML code is built into the page.
The createElement()
function starts by creating an empty div element; line 19 populates it with the content of code
. Following this, appendChild()
appends the content of the div inputarea
which was empty when the page was loaded. appendChild()
does not overwrite existing code, so you can add any number of lines with search parameters. The add_input()
event handler (Listing 3, line 17) adds the second column for a value parameter when selecting a search field. Unfortunately, this does not work with Konqueror because Konqueror's script engine ignores dynamically added event handlers.
Listing 3: Field of Search Parameters |
01 var input_code=[ 02 '', 03 '<select class="result" name="type"> 04 <option value="0">fruit</option> 05 <option value="1">vegetables</option> 06 </select><br/>', 07 '<input class="result" type="text" 08 name="color"><br/>', 09 '<input class="result" type="text" 10 name="taste"><br/>', 11 '<select class="result" name="size"> 12 <option value="0">small</option> 13 <option value="1">medium</option> 14 <option value="2">large</option> 15 </select><br/>' ]; 16 17 function add_input(n, e) { 18 document.getElementById('input_search' + n).innerHTML = 19 "\n = " + input_code[e.value]; 20 } |
add_input()
uses the n
line counter to find the correct input_search
n
div element. e
returns a reference to the left drop-down field, e.value
, which is the numeric value selected by clicking the field. On this basis, add_input()
the checks the code table in the input_code[]
array. To match the search file selected in the left column, this contains the HTML code for the right input field/search field - either another drop-down box or a text input field. The input field's name
attribute also matches the field name selected to the left. This completes the search form, which can be clicked together on the fly. Now it's time to look at the PublicSQL-based search form.
In contrast to forms sent to the server, where the data is automatically located in the POST
headers or GET
parameters, the program logic first has to parse the permanently open, client-side form in the DOM tree.
The document.form.elements
attribute, as seen in line 7 of Listing 4, returns the list of input elements in the form - unfortunately, this includes the buttons, which do not contain any data. Additionally, we only need the input fields from the right column. The name
and value
pair contains everything needed to query the database. Luckily, the HTML code in the input_code
in Listing 3 has thought of this and provides a result
class attribute filter criterion.
Listing 4: Constructing an SQL Query |
01 function execute_search(){ 02 var element; 03 var sql; 04 var firstloop = true; 05 06 sql = "SELECT * FROM plants WHERE "; 07 for (e in document.form.elements) { 08 element = document.form.elements[e]; 09 if (element) { 10 if (element.className == 'result') { 11 if (!firstloop) sql += " AND "; 12 firstloop = false; 13 sql += element.name + " = '" + 14 element.value + "'"; 15 } 16 } 17 } 18 sql += " ORDER BY type, size, name"; 19 publicSQL.query(sql, "show_result"); 20 } |
The for
loop in lines 7 through 17 of the execute_search
function now has no trouble filtering out the required data via the element.className
element. In Internet Explorer, the list of form elements (document.form.elements
) contains blank entries - nobody knows why, apart from Microsoft. To keep Internet Explorer happy, you thus need to query whether elements[e]
is set (lines 8, 9).
The loop in lines 7 through 17 creates the WHERE
clause for the SQL statement from the element.name
and element.value
contents of all the elements in the result
class. For a read-only database that stores its data client-side, SQL injection isn't an issue.
Insensitive to Complex Search Requests |
Joins or more complex queries had very little effect on the response times in our lab. However, developers have to come to terms with the fact that - faced with the scales encountered here - browsers might balk when asked to give JavaScript so much computational time in the case of PTF files with many data elements. Firefox even refused to create new objects. |
Line 11 (Listing 4) uses AND
to link the conditions. Line 6 stores the header for the query in the sql
variable; line 18 adds an ORDER BY
clause, and line 19 does the work: It calls PublicSQL with the resulting query as its first parameter. As the second parameter, the database engine expects the name of a callback function (show_result()
, see Listing 5), which it calls after a successful query with a results array as a parameter.
Listing 5: Display Result |
01 var types=["fruit", "vegetables"]; 02 var sizes=["small", "medium", "large",] 03 04 function show_result(result){ 05 var html = "\n<table>\n<tr><th>Name</th> 06 <th>Fruit or vegetables</th> 07 <th>Color</th> 08 <th>Taste</th> 09 <th>Size</th></th>"; 10 for (line =1; line < result.length; line++) { 11 html+="\n<tr>"; 12 html+="<td>"+result[line][1]+"</td>"; 13 html+="<td>"+types[result[line][2]]+"</td>"; 14 html+="<td>"+result[line][3]+"</td>"; 15 html+="<td>"+result[line][4]+"</td>"; 16 html+="<td>"+sizes[result[line][5]]+"</td>"; 17 html+="</tr>"; 18 } 19 html+="</table>"; 20 document.getElementById('result'). 21 innerHTML = html; 22 } number one |
In the result returned by PublicSQL, result[0][column 0 through column n
] contains the field names and result[n
] [column 0 through column x
] the data for the nth line. That is, show_result()
need only iterate over array elements starting with index 1
and distribute the columns over the cells of the table. Columns 2 and 5 do not contain the numeric values from the database. The corresponding long text comes from the types
and sizes
arrays (Listing 5, lines 1, 2). At line 20, JavaScript writes the HTML code to the position reserved in the static basic framework - the div
tag (Listing 1, line 9).
Listing 1: HTML Framework |
01 <head> 02 <meta http-equiv="Content-Type" content="text/html; charset=UTF-8"> 03 <script type="text/javascript" src="publicsql.js"></script> 04 </head> 05 <body> 06 Search for: 07 <form name="form" action=""> 08 <button onclick="add_searchfield(); return false">+ Searchcriterion</button> 09 <div id="inputarea"></div> 10 <button onclick="execute_search(); return false">Execute search</button> 11 </form> 12 Search result: 13 <div id="result"></div> 14 </body> |
The sample application gives you an idea of the service PublicSQL can give JavaScript programmers. To create a complex search function, you simply need to craft the matching SQL code rather than the complex queries in JavaScript. The same applies to interactive applications that select, recombine, and visualize data at run time. Having the library pass data to a callback function instead of returning them as an array or object is impractical. The search results are only available inside the callback function. If you try to bind the results array to a global variable to keep it available for the downstream program flow, the attempt tends to fail from a race condition.
The number of records is restricted: For tables with more than 10,000 entries, the Mozilla engine returns a Too many constructor Elements
error (see the "Performance" box); Internet Explorer can handle a few more. For queries against tables of this size with a resulting volume of no more than 100 records, the browser will respond without a noticeable delay.
Performance |
Obviously, PublicSQL can't achieve the same query speed as a classical database system. In a production environment, however, it does make sense to look at the bottom line. The time between clicking and the reaction in the browser is decisive for the user experience. Users will be happy to accept the fact that a page will take a couple of seconds to load once. |
A Quick Approach to Downloading Data |
In one or two seconds, an average DSL connection will let you download around 1MB of data, which can be a multiple of 10,000 records. However, if the latency of the network connection (especially in mobile Internet applications), the web server, its application, and the database back end is greater than that of the JavaScript applications running locally, the response for the user will be smoother than an online approach. The limits depend on many parameters and configurations. In a typical test setup, PublicSQL searched through a four-column table with more than 15,000 EU-funded projects in about 20 seconds. Developers should take into account that dynamically creating the HTML table will take up a large share of this time. |
INFO |
[1] PublicSQL: http://www.en.publicsql.org/
[2] Indexed database API: http://dvcs.w3.org/hg/IndexedDB/raw-file/tip/Overview.html [3] Sample application for this article: http://www.linux-magazine.com/Resources/Article-Code [4] Documentation: http://www.en.publicsql.org/documentation.htm |
AUTHOR |
Peter Kreußel has written numerous articles for Linux Magazine. He works as a freelance author and web developer and is particularly interested in semantic technologies and cross-media publishing. |