Chapter 5. Frequently Asked Question

This chapter provides you all FAQs related to the contents mentioned above.

5.1. JCR FAQ

It's the draft for a future FAQ of JCR usage.

5.1.1. Kernel

5.1.1.1. What is the best, standardized way to get the instance of a service ?

container.getComponentInstanceOfType(ServiceName.class);

5.1.2. JCR

5.1.2.1. JCR core

5.1.2.1.1. Is it better to use Session.getNodeByUUID or Session.getItem?

Session.getNodeByUUID() about 2.5 times faster of Session.getItem(String) and only 25% faster of Node.getNode(String). See the daily tests results for such comparisons, e.g.

http://tests.exoplatform.org/jcr.html

5.1.2.1.2. Does it make sense to have all the node referencable to use getNodeByUUID all the time?

Until it's applicable for a business logic it can be. But take in account the paths are human readable and lets you think in hierarchy. If it's important a location based approach is preferable.

5.1.2.1.3. What should I use to check if an Item exists before getting the Value?

Use Session.itemExists(String absPath), Node.hasNode(String relPath) or Property.hasProperty(String name). It's also is possible to check Node.hasNodes() and Node.hasProprties().

SELECT * FROM nt:unstructured WHERE jcr:path LIKE 'testRoot/%'

For specified jcr:path ordering there is different proceeding in XPath and SQL:

SQL no matter ascending or descending - query returns result nodes in random order: {code}SELECT * FROM nt:unstructured WHERE jcr:path LIKE 'testRoot/%' ORDER BY jcr:path{code}

XPath - jcr:path order construction is ignored (so result is not sorted according path); {code}/testRoot/* @jcr:primaryType='nt:unstructured' order by jcr:path{code}

5.1.2.1.8. How eXo JCR indexer uses content encoding?

1. Indexer uses jcr:encoding property of nt:resource node (used as jcr:content child node of nt:file) 2. if no jcr:encoding property set the Document Service will use the one configured in the service (defaultEncoding) 3. if nothing is configured a JVM, the default encoding will be used

5.1.2.1.9. Which database server is better for eXo JCR?

If the question is about the performance, it is difficult to answer, because each database database can be configured to have better performance in special case. According to the results of our internal tests the best choice is Oracle 11G R2 even when we store the binary data in the db, on other db it is recommended to store the binary data on the file system unless you have only small file content to store. MySQL and PostgreSQL also demonstrated in our benchmark results that they could provide good performance. DB2 and MSSQL are slower in default configurations. Default configuration of Sybase leader of slowness. But in this question, take the database server maintenance in account. MySQL and PostgreSQL are simple in installation and can work even on limited hardware. Oracle, DB2, MSSQL or Sybase need more efforts. The same actual for maintenance during the work. Note for Sybase: "check-sns-new-connection" data container configuration parameter should be set to "true". For testing purpose, embedded database such as HSQLDB is the best choice. Apache Derby and H2 also supported. But H2 surprisingly needs "beta" feature enabled - MVCC=TRUE in JDBC url.

5.1.2.1.10. How to setup eXo JCR for mutilingial content on MySQL?

To allow multiple character sets to be sent from the client, the UTF-8 encoding should be used, either by configuring utf8 as the default server character set, or by configuring the JDBC driver to use UTF-8 through the characterEncoding property. MySQL database should be created in single-byte encoding, e.g. "latin1":

CREATE DATABASE db1 CHARACTER SET latin1 COLLATE latin1_general_cs;

eXo JCR application (e.g. GateIn) should use JCR dialect "MySQL-UTF8".

In other words: MySQL database default encoding and JCR dialect cannot be UTF8 both. Use single-byte encoding (e.g. "latin1") for database and "mysql-utf8" dialect for eXo JCR.

Notice: "MySQL-UTF8" dialect cannot be auto-detected, it should be set explicitly in configuration.

5.1.2.1.11. Does MySQL have limitation affecting on eXo JCR features?

Index's key length of JCR_SITEM (JCR_MITEM) table for mysql-utf8 dialect is reduced to 765 bytes (or 255 chars).

5.1.2.1.12. Does use of Sybase database need special options in eXo JCR configuration?

To enable JCR working properly with Sybase, a property 'check-sns-new-connection' with 'false' value is required for each workspace data container:

<container class="org.exoplatform.services.jcr.impl.storage.jdbc.optimisation.CQJDBCWorkspaceDataContainer">
  <properties>
    <property name="source-name" value="jdbcjcr" />
    <property name="dialect" value="auto" />
    <property name="multi-db" value="true" />
    <property name="max-buffer-size" value="200k" />
    <property name="swap-directory" value="target/temp/swap/ws" />
    <property name="swap-directory" value="target/temp/swap/ws" />
    <property name="check-sns-new-connection" value="false" />
  </properties>

5.1.2.1.13. How to open and close a session properly to avoid memory leaks?

Session session = repository.login(credentials);
try
{
// here your code
}
finally
{
session.logout();
}

5.1.2.1.14. Can I use Session after loging out?

No. Any instance of Session or Node (acquired through session) shouldn't be used after loging out anymore. If you use Session or Node after logging out then you get an exception.

5.1.2.1.15. How to configure jcr for cluster ?

So we have configured JCR in standalone mode and want to reconfigure it for clustered environment. First of all, let's check whether all requirements are satisfied:

Dedicated RDBMS such as MySQL, Postges, Oracle and, etc but just not HSSQL;
Shared storage. The simplest thing is to use shared FS like NFS or SMB mounted in operation system, but they are rather slow. The best thing is to use SAN (Storage Area Network);
Fast network between JCR Cluster nodes.

So now, we need to configure the Container a bit. Check exo-configuration.xml to be sure that you are using JBossTS Transaction Service and Infinispan Transaction Manager, as shown below.

<component>
   <key>org.infinispan.transaction.lookup.TransactionManagerLookup</key>
   <type>org.exoplatform.services.transaction.infinispan.JBossStandaloneJTAManagerLookup</type>
</component>
   
<component>
  <key>org.exoplatform.services.transaction.TransactionService</key>
  <type>org.exoplatform.services.transaction.infinispan.JBossTransactionsService</type>
  <init-params>
    <value-param>
      <name>timeout</name>
      <value>3000</value>
    </value-param>
  </init-params>   
</component>

Next stage is actually the JCR configuration. We need Infinispan configuration templates for : data-cache, indexer-cache and lock-manager-cache. Later they will be used to configure JCR's core components. There are pre-bundled templates in EAR or JAR in conf/standalone/cluster. They can be used as is or re-written if needed. And now, re-configure a bit each workspace. Actually, a few parameters need changing, e.g. <cache>, <query-handler> and <lock-manager>.

<cache> configuration should look like this:

<cache enabled="true"
   class="org.exoplatform.services.jcr.impl.dataflow.persistent.infinispan.ISPNCacheWorkspaceStorageCache">
   <properties>
      <property name="infinispan-configuration" value="conf/standalone/cluster/test-infinispan-config.xml" />
      <property name="jgroups-configuration" value="udp-mux.xml" />
      <property name="infinispan-cluster-name" value="JCR-cluster" />
   </properties>
</cache>

"infinispan-configuration" is the path to configuration template;
"jgroups-configuration" is path to JGroups configuration that relies on JGroups shared transport.
"infinispan-cluster-name" is the name of cluster group.

<query-handler> configuration

You must replace or add in the <query-handler> block, the "changesfilter-class" parameter equals with:

<property name="changesfilter-class" value="org.exoplatform.services.jcr.impl.core.query.ispn.ISPNIndexChangesFilter"/>

add Infinispan-oriented configuration:

<property name="infinispan-configuration" value="conf/standalone/cluster/test-infinispan-indexer.xml" />
<property name="jgroups-configuration" value="udp-mux.xml" />
<property name="infinispan-cluster-name" value="JCR-cluster" />
<property name="max-volatile-time" value="60" />

Those properties have the same meaning and restrictions as in the previous block. The last property "max-volatile-time" is not mandatory but recommended. This notifies that the latest changes in index will be visible for each cluster node not later than in 60s.

<lock-manager> configuration
Maybe this is the hardest element to configure, because we have to define access to DB where locks will be stored. Replace exsiting lock-manager configuration with the next one:
```
<lock-manager class="org.exoplatform.services.jcr.impl.core.lock.infinispan.ISPNCacheableLockManagerImpl">
   <properties>
      <property name="time-out" value="15m" />
      <property name="infinispan-configuration" value="conf/standalone/cluster/test-infinispan-lock.xml" />
      <property name="jgroups-configuration" value="udp-mux.xml" />
      <property name="infinispan-cluster-name" value="JCR-cluster" />
      <property name="infinispan-cl-cache.jdbc.table.name" value="lk" />
      <property name="infinispan-cl-cache.jdbc.table.create" value="true" />
      <property name="infinispan-cl-cache.jdbc.table.drop" value="false" />
      <property name="infinispan-cl-cache.jdbc.id.column" value="id" />
      <property name="infinispan-cl-cache.jdbc.data.column" value="data" />
      <property name="infinispan-cl-cache.jdbc.timestamp.column" value="timestamp" />
      <property name="infinispan-cl-cache.jdbc.datasource" value="jdbcjcr" />
      <property name="infinispan-cl-cache.jdbc.dialect" value="${dialect}" />
      <property name="infinispan-cl-cache.jdbc.connectionFactory" value="org.exoplatform.services.jcr.infinispan.ManagedConnectionFactory" />
   </properties>
</lock-manager>
```
First few properties are the same as in the previous components, but here you can see some strange "infinispan-cl-cache.jdbc.*" properties. They define access parameters to the database where lock are persisted.

That's all. The JCR is ready to join a cluster.

5.1.2.1.16. How to use lucene spellchecker?

There is few steps:

Enable lucene spellchecker in jcr QueryHandler configuration:

<query-handler class="org.exoplatform.services.jcr.impl.core.query.lucene.SearchIndex">
   <properties>
      ...
      <property name="spellchecker-class" value="org.exoplatform.services.jcr.impl.core.query.lucene.spell.LuceneSpellChecker$FiveSecondsRefreshInterval" />
      ...
   </properties>
</query-handler>

Execute query with rep:spellcheck function and word that is checked:

Query query = qm.createQuery("select rep:spellcheck() from nt:base where " + "jcr:path = '/' and spellcheck('word that is checked')", Query.SQL);
RowIterator rows = query.execute().getRows();

Fetch a result:

Row r = rows.nextRow();
Value v = r.getValue("rep:spellcheck()");

If there is no any results, that means there is no suggestion, so word is correct or spellcheckers dictionary do not contain any words like the checked word.

5.1.2.1.17. How can I affect to spellchecker results?

There is two parameters in jcr QueryHandler configuration:

Minimal distance between checked word and proposed suggestion;

Search for more popular suggestions;

<query-handler class="org.exoplatform.services.jcr.impl.core.query.lucene.SearchIndex">
   <properties>
      ...
      <property name="spellchecker-class" value="org.exoplatform.services.jcr.impl.core.query.lucene.spell.LuceneSpellChecker$FiveSecondsRefreshInterval" />
      <property name="spellchecker-more-popular" value="false" />
      <property name="spellchecker-min-distance" value="0.55" />
      ...
   </properties>
</query-handler>

Minimal distance is counted as Levenshtein distance between checked word and spellchecker suggestion.

If the proposed word exists in the directory - no suggestion given;
If the proposed word doesn't exist in the directory - propose the closed word;

If "morePopular" enabled:

No matter word exists or not, checker will propose the closed word that is more popular than the checked word.

5.1.2.2. JCR extensions

5.1.2.2.1. How to restore repository to existing repository ?

Remove existing repository, use:

RepositoryService.removeRepository(String repositoryName)

Restore repository, use

BackupManager.restore(RepositoryBackupChainLog log, RepositoryEntry repositoryEntry, boolean asynchronous)

5.1.2.2.2. How to restore workspace to existing worksapce?

Remove existing workspace, use:

ManageableRepository.removeWorkspace(String workspaceName)

Restore workspace, use:

BackupManager.restore(BackupChainLog log, String repositoryName, WorkspaceEntry workspaceEntry, boolean asynchronous)

5.1.2.2.3. Does JCR support hot backup?

Yes, JCR is support hot backup. Will use org.exoplatform.services.jcr.ext.backup.BackupManager.

5.1.2.3. WebDAV

5.1.2.3.1. I uploaded a file to WebDAV server using Mac OS Finder, but the file size is '0', what is wrong ?

This is known as a finder bug started from Mac OS v.10.5.3 and not yet fixed, .

For more details follow:  Apple Disscussion thread.

5.1.2.3.2. Can I manage 'cache-control' value for different media-types from server configuration ?

Use "cache-control" configuration parameter.

The value of this parameter must contain colon-separated pairs "MediaType:cache-control value"

For example, if you need to cache all text/xml and text/plain files for 5 minutes (300 sec.) and other text/\* files for 10 minutes (600 sec.), use the next configuration:

<component>
   <type>org.exoplatform.services.jcr.webdav.WebDavServiceImpl</type>
   <init-params>
      <value-param>
         <name>cache-control</name>
         <value>text/xml,text/plain:max-age=300;text/*:max-age=600;</value>
      </value-param>
   <init-params>
<component>

5.1.2.3.3. How to perform WebDAV requests using curl ?

Simple Requests

For simple request such as: GET, HEAD, MKCOL, COPY, MOVE, DELETE, CHECKIN, CHECKOUT, UNCHECKOUT, LOCK, UNLOCK, VERSIONCONTROL, OPTIONS

perform:

curl -i -u 'user:pass' -X 'METHOD_NAME' 'resource_url'

for example to create a folder named test perform:

curl -i -u 'root:exo' -X MKCOL 'http://localhost:8080/rest/jcr/repository/production/test

to PUT a test.txt file from your current folder to "test "folder on server perform:

curl -i -u 'root:exo' -X PUT 'http://localhost:8080/rest/jcr/repository/production/test/test.txt' -d @test.txt

Requests with XML body

For requests which contains xml body such as: ORDER, PROPFIND, PROPPATCH, REPORT, SEARCH

add -d 'xml_body text' or -d @body.xml

(body.xml must contain a valid xml request bidy.) to you curl-command:

curl -i -u 'user:pass'  -X 'METHOD_NAME' -H 'Headers' 'resource_url' -d 'xml_body text'

For example about finding all files containing "test" perform:

curl -i -u "root:exo" -X "SEARCH" "http://192.168.0.7:8080/rest/jcr/repository/production/" -d
"<?xml version='1.0' encoding='UTF-8' ?>
   <D:searchrequest xmlns:D='DAV:'>
      <D:sql>SELECT * FROM nt:base WHERE contains(*, 'text')</D:sql>
</D:searchrequest>"

If you need to add some headers to your request, use \-H key.

To have more information about methods parameters, you can find in HTTP Extensions for Distributed Authoring specification.

5.1.2.3.4. How eXo JCR WebDAV server treats content encoding?

OS client (Windows, Linux etc) doesn't set an encoding in a request. But eXo JCR WebDAV server looks for an encoding in a Content-Type header and set it to jcr:encoding. See http://www.w3.org/Protocols/rfc2616/rfc2616-sec14.html, 14.17 Content-Type. e.g. Content-Type: text/html; charset=ISO-8859-4 So, if a client will set Content-Type header, e.g. JS code from a page, it will works for a text file as expected.

If WebDAV request doesn't contain a content encoding, it's possible to write a dedicated action in a customer application. The action will set jcr:encoding using its own logic, e.g. based on IP or user preferences.