Skip to main content

Notice: this Wiki will be going read only early in 2024 and edits will no longer be possible. Please see: https://gitlab.eclipse.org/eclipsefdn/helpdesk/-/wikis/Wiki-shutdown-plan for the plan.

Jump to: navigation, search

Difference between revisions of "SMILA/FAQ"

(I implemented/deployed a OSGi Service in a new bundle but SMILA log says that it couldn't be found)
m (How can I see that SMILA started correctly?)
(17 intermediate revisions by 5 users not shown)
Line 29: Line 29:
 
=== How can I see that SMILA started correctly? ===
 
=== How can I see that SMILA started correctly? ===
  
You should see no stacktraces in the log ;) and it should end with an entry like that if SMILA has just started:
+
1. Open your browser at <code>http://localhost:8080/smila</code>, you should see a system state overview and links to different APIs.
 +
 
 +
2. You should see no stacktraces in the log ;) and it should end with an entry like the following if SMILA has just started:
  
 
<pre>
 
<pre>
Line 119: Line 121:
 
|}
 
|}
  
For the complete documentation on eclipse runtime options please see: http://help.eclipse.org/ganymede/index.jsp?topic=/org.eclipse.platform.doc.isv/reference/misc/runtime-options.html
+
For the complete documentation on eclipse runtime options please see: http://help.eclipse.org/indigo/index.jsp?topic=/org.eclipse.platform.doc.isv/reference/misc/runtime-options.html
  
 
=== Bundles ===
 
=== Bundles ===
  
==== Launching crawler bundles - How to solve: "Could not find crawler id" error message? ====
+
==== new bundle was not started  ====
  
While launching SMILA I receive the following error message:
+
After launching SMILA my new bundle doesn't seem to be started.
  
<code>Could not find crawler id</code>
+
If you started '''<tt>SMILA.launch</tt>''' in eclipse to launch SMILA: The launcher didn't start your new bundle. Try this:
  
If you started '''<tt>SMILA.launch</tt>''' to launch SMILA: The launcher didn't start your new crawler bundle. Try this:
+
# Add your bundle by selecting "Run Configurations" in eclipse and choose your SMILA profile.
 
+
# Add your bundle by selecting "Open Run dialog" in eclipse and choose your SMILA profile.
+
 
# Select your bundle in the list and set the checkmark.
 
# Select your bundle in the list and set the checkmark.
 
# Set the start level to "4" and the autostart to "true".
 
# Set the start level to "4" and the autostart to "true".
Line 138: Line 138:
  
 
<ol>
 
<ol>
<li>Open the file <tt>config.ini</tt> and add your bundle as shown below:<br>
+
<li>Open the file <tt>configuration/config.ini</tt> and add your bundle as shown below:<br>
 
<code>org.eclipse.smila.connectivity.framework.crawler.owncrawler@5:start, \</code></li>
 
<code>org.eclipse.smila.connectivity.framework.crawler.owncrawler@5:start, \</code></li>
 
<li>Open the <tt>build.properties</tt> file of your bundle and include the folders <tt>schemas/</tt>, <tt>OSGI-INF/</tt>, and the file <tt>plugin.xml</tt>.</li>
 
<li>Open the <tt>build.properties</tt> file of your bundle and include the folders <tt>schemas/</tt>, <tt>OSGI-INF/</tt>, and the file <tt>plugin.xml</tt>.</li>
 
</ol>
 
</ol>
 
==== I changed the implementation of a bundle, deployed it and restarted SMILA, but SMILA still seems to use the old bundle ====
 
 
Close SMILA, delete the following directories in your configuration folder and restart SMILA again:
 
* <tt>org.eclipse.core.runtime/</tt>
 
* <tt>org.eclipse.equinox.app/</tt>
 
* <tt>org.eclipse.osgi/</tt>
 
* <tt>org.eclipse.update/</tt>
 
  
 
==== I changed bundle settings in my <tt>config.ini</tt>, but after SMILA restart nothing changed ====
 
==== I changed bundle settings in my <tt>config.ini</tt>, but after SMILA restart nothing changed ====
Line 158: Line 150:
  
  
=== JConsole ===
+
== Configuring/Running SMILA ==
  
==== Why is the ''SMILA'' package not in the JConsole tree? ====
+
=== Basics ===
  
I've started <tt>SMILA.exe</tt> but the SMILA package isn't in the tree of JConsole.
+
==== How do I change the HTTP port of SMILA? ====
  
To solve this try the following:
+
The HTTP port is configured in fie <tt>configuration/org.eclipse.smila.clusterconfig.simple/clusterconfig.json</tt>. Change it there and restart SMILA.
  
# Create a new connection.
+
See [[SMILA/Documentation/JettyHttpServer#Setting_the_HTTP_port]] for details.
# Change your connection by setting the port "9004" on the ''Remote'' tab.
+
# Click the ''Connect'' button, switch to the ''MBeans'' tab, and check the tree again.
+
 
+
==== Why is the ''SMILA/CrawlerController'' MBean not in the JConsole tree? ====
+
 
+
Check if all needed bundles are active. Open the equinox console and type:
+
* <tt>ss crawler</tt>
+
* <tt>ss deltaindexing.impl</tt>
+
 
+
If one of these is not active:
+
* check the <tt>configuration/config.ini</tt>
+
* check the log file for errors
+
 
+
==== Cannot connect with JConsole to a remote machine running Ubuntu 8.04 or newer ====
+
 
+
For some reason the JConsole cannot make a '''''remote''''' connection to JVM running on Ubuntu 8.04 or newer installations. (We did some tests with SuSE Linux and had no problems.)
+
 
+
This can be easily circumvented by modifying <tt>/etc/hosts</tt> file.
+
Simply replace <tt>127.0.1.1</tt> with the real IP address of your Ubuntu machine and you're ready to go.
+
 
+
For example, if you have a line in /etc/hosts that looks like this:
+
 
+
<tt>127.0.1.1 jupiter</tt>        (Where "jupiter" it the actual name of the Ubuntu machine in this example. You will almost certainly have some other name here ;-)
+
 
+
then replace it with:
+
 
+
<tt>192.168.220.101 jupiter</tt>  (Where 192.168.220.101 is the actual IP address of the Ubuntu machine in this example.)
+
 
+
== Configuring/Running SMILA ==
+
  
 
=== Crawler ===
 
=== Crawler ===
Line 205: Line 168:
 
Check the state of the crawl job in [http://localhost:8080/smila/jobmanager/jobs jobmanager state API]
 
Check the state of the crawl job in [http://localhost:8080/smila/jobmanager/jobs jobmanager state API]
 
* if the state in your crawl job is "RUNNING", check the mode it has been started
 
* if the state in your crawl job is "RUNNING", check the mode it has been started
** if it says {{code|"runMode": "STANDARD"}}, stop the job run ({{code|POST http://localhost:8080/smila/jobmanager/jobs/<job name>/<job run id>/finish}}) and start it again with mode "runOnce", e.g. {{code|POST http://localhost:8080/smila/jobmanager/jobs/<job name> {"mode": "runOnce"}}}
+
** if it says {{code|"mode": "STANDARD"}}, stop the job run ({{code|POST http://localhost:8080/smila/jobmanager/jobs/<job name>/<job run id>/finish}}) and start it again with mode "runOnce", e.g. {{code|POST http://localhost:8080/smila/jobmanager/jobs/<job name> {"mode": "runOnce"}}}
 
* if the state is FINISHING
 
* if the state is FINISHING
 
** check the job into which the crawl job pushes its records for processed records or errors.
 
** check the job into which the crawl job pushes its records for processed records or errors.
Line 211: Line 174:
 
** just wait some time, the autocommit handler maybe hasn't yet committed the data to the Solr index
 
** just wait some time, the autocommit handler maybe hasn't yet committed the data to the Solr index
 
** check {{code|autoCommit}} settings in the <tt>solrconfig.xml</tt> file of your Solr core.
 
** check {{code|autoCommit}} settings in the <tt>solrconfig.xml</tt> file of your Solr core.
 
==== I tried to crawl/index a data source (with the deprecated connectivity framework) to a job, JConsole says "Crawler ... sucessfully started" but nothing seems to happen ====
 
 
Check the SMILA log file for errors.
 
 
Check if your specified job name exists and is running:
 
# Open a browser and go to [http://localhost:8080/smila/jobmanager/jobs jobmanager state API]
 
# browse to your specified job and check if there is an active job run (state: RUNNING)
 
# if so, check the job run details
 
 
If you find no errors in the log and your job is running but doesn't seem to do something:
 
# Did you crawl the data source before? If so, try a <code>clearAll</code> on the DeltaIndexing-MBean in your JConsole
 
# Check your crawler configuration file (see <tt>configuration/org.eclipse.smila.connectivity.framework/</tt>)
 
  
 
=== Pipeline ===  
 
=== Pipeline ===  
Line 232: Line 182:
 
: <tt>configuration/org.eclipse.smila.processing.bpel/processor.properties</tt>
 
: <tt>configuration/org.eclipse.smila.processing.bpel/processor.properties</tt>
 
See entry: <code>pipeline.timeout</code>
 
See entry: <code>pipeline.timeout</code>
 +
 +
==== My xpath expression doesnt work ====
 +
# Check that the expression is correct with a XPath tool
 +
# ensure that the attributes are not filtered out, see [[SMILA/Documentation/HowTo/How_to_filter_and_access_record_data_in_BPEL]]
 +
 +
'''Helpful log points'''
 +
{{CTable}}
 +
| Area || id || level || Comment
 +
|-
 +
| BPEL || org.eclipse.smila.processing.bpel.PipeletManager  || debug || Log BPEL request XML before and after pipelet invocation. Note: this contains the XML for the records but only their filtered version.
 +
|-
 +
| BPEL || org.eclipse.smila.processing.bpel.PipeletManager  || trace|| Log full XML of records after pipelet invocation. Note: this is the real thing unfiltered.
 +
|}
  
 
== Implementing Pipelets / OSGi Services / Bundles ==
 
== Implementing Pipelets / OSGi Services / Bundles ==
Line 242: Line 205:
 
# <tt><SMILA>/configuration/<bundle-name>/<config-file></tt>
 
# <tt><SMILA>/configuration/<bundle-name>/<config-file></tt>
 
# <tt><config-file></tt> in the root path of the bundle jar-file
 
# <tt><config-file></tt> in the root path of the bundle jar-file
 +
 +
See [[SMILA/Project_Concepts/Simple_configuration_handler|Configuration Handler]] for more information.
  
 
=== Deploy / Launch ===
 
=== Deploy / Launch ===
Line 283: Line 248:
 
Thamks to Bogdan Sacaleanu for the solution. See this [http://smila.markmail.org/thread/sj4vhcikq2wndtdp thread in the smila-dev mailing list] for additional details.
 
Thamks to Bogdan Sacaleanu for the solution. See this [http://smila.markmail.org/thread/sj4vhcikq2wndtdp thread in the smila-dev mailing list] for additional details.
  
 +
==== I get classloading errors when I try to access an external Web Service using JAX-WS ====
 +
 +
Class loading problems often occur when using third party libraries that use the "thread context classloader" in OSGi, and standard implementations of Java specifications by Sun/Oracle (or other non-OSGi-aware parties...) do this very often.
 +
 +
The only solution we currently know of is to wrap the critical section (in this case it's the construction of the webservice client class) in a piece of code like this:
 +
 +
<source lang="java">
 +
ClassLoader tccl = Thread.currentThread().getContextClassLoader();
 +
try {
 +
    Thread.currentThread().setContextClassLoader(getClass().getClassLoader());
 +
    /*
 +
    * initialize the client
 +
    */
 +
} finally {
 +
    Thread.currentThread().setContextClassLoader(tccl);
 +
}
 +
</source>
 +
 +
See also [http://www.eclipse.org/forums/index.php/t/266362/ this forum thread].
 +
 +
==== More classloader errors - part 1 ====
 +
 +
If you get an error like this:
 +
 +
<source lang="text">
 +
Caused by: java.lang.LinkageError: loader constraint violation: when resolving field "DATETIME" the class loader (instance of
 +
org/eclipse/osgi/internal/baseadaptor/DefaultClassLoader) of the referring class, javax/xml/datatype/DatatypeConstants, and the class loader (instance of <bootloader>) for the field's resolved type, javax/xml/namespace/QName, have different Class objects for that type
 +
</source>
 +
 +
the reason is most likely that you have classes in the bundle classpath of your bundle that contains classes which are also part of the JDK runtime library (java.*, javax.*). Remove these classes from the bundle and it should work.
  
 +
See also [http://www.eclipse.org/forums/index.php/t/266362/ this forum thread].
  
 
[[Category:SMILA]]
 
[[Category:SMILA]]

Revision as of 06:38, 16 November 2012

This pages contains the frequently asked questions of the SMILA project.

General Hint: When you have problems during a SMILA launch / run, please have a look at the SMILA log file first. (<SMILA>/SMILA.log)

General

Where do I find the log file?

The SMILA log file's name is SMILA.log and is to be found at the applications working folder:

If you extracted SMILA from a zip, you can find the log file in the folder, where you extracted the SMILA.

/<SMILA>
  /about_files
  /configuration
  /features
  /jmxclient
  /plugins
  /workspace
  .eclipseproduct
  ...
  SMILA
  SMILA.ini
  -> SMILA.log <-

If you started SMILA from within the Eclipse IDE using the launcher, you can find the log file at the project SMILA.application in your workspace.

How can I see that SMILA started correctly?

1. Open your browser at http://localhost:8080/smila, you should see a system state overview and links to different APIs.

2. You should see no stacktraces in the log ;) and it should end with an entry like the following if SMILA has just started:

... INFO  ... internal.HttpServiceImpl    - HTTP server started successfully on port 8080.

Building SMILA

I receive an Out of Memory error? What can I do?

While building with SMILA.builder I receive the following errror message:

Build Failed - Out of Memory - Java heap space

The reason for this is that Ant hasn´t enough heap space to build the project. You will have to expand the heap space by setting the VM arguments accordingly. In eclipse try the following:

  1. Click Open external tools dialog and select your Ant build profile.
  2. Switch to the JRE tab and add the following VM arguments: -Xms40m -Xmx512m.
  3. Save and build again.


Build fails with 'java.net.MalformedURLException: no protocol: ${eclipseBaseURL}' message

The complete error looks like this:

...
build.core:
     [java] Buildfile: /home/scum36/Eclipse/helios-sr2-build/plugins/org.eclipse.pde.build_3.6.2.R36x_20110203/scripts/productBuild/productBuild.xml
     [java] 
     [java] main:
     [java] 
     [java] preBuild:
     [java] 
     [java] preSetup:
     [java] 
     [java] checkLocalMaps:
     [java] 
     [java] getMapFiles:
     [java]       [cvs] Caught exception: Cannot run program "cvs" (in directory "/home/scum36/workspace/SMILA/smila.build/maps"): java.io.IOException: error=2, No such file or directory
     [java] 
     [java] postSetup:
     [java] 
     [java] checkLocalBase:
     [java] 
     [java] getBaseComponents:
     [java] 
     [java] BUILD FAILED
     [java] /home/scum36/Eclipse/helios-sr2-build/plugins/org.eclipse.pde.build_3.6.2.R36x_20110203/scripts/productBuild/productBuild.xml:33: The following error occurred while executing this line:
     [java] /home/scum36/Eclipse/helios-sr2-build/plugins/org.eclipse.pde.build_3.6.2.R36x_20110203/scripts/build.xml:53: The following error occurred while executing this line:
     [java] /home/scum36/workspace/SMILA/SMILA.builder/customTargets.xml:80: The following error occurred while executing this line:
     [java] /home/scum36/workspace/SMILA/SMILA.builder/customTargets.xml:25: java.net.MalformedURLException: no protocol: ${eclipseBaseURL}
     [java] 
     [java] Total time: 0 seconds
     [java] An error has occurred. See the log file
     [java] /home/scum36/workspace/.metadata/.log.


Probably you did forget to copy build.properties.template to build.properties in SMILA.builder and adapt it. See How to build SMILA.

Launching SMILA

Linux

How to start/stop and manage SMILA as a background process on a Linux machine?

Since the default configuration (stored in SMILA.ini) of the OSGi runtime (in our case Equinox) launcher expects that you execute it in foreground and therefore have an OSGi console running in your shell and listening to the standard input, the first thing we have to do is to advise the launcher (and thereby Equinox) to listen on some TCP port instead. This is done by adding a new line with the port number just after the "-console" line.

For example, to set console to listen at TCP port 9999, SMILA.ini would look like this:

-console
9999
...

Now, after SMILA has been started with “$ nohup ./SMILA &”, the console can be accessed from any computer simply by opening a telnet session:

$ telnet <smila_host_name> <console_port>


Symbol comment vote.png If you are logged in via telnet and just want to close this connection and not to stop Equinox running SMILA, than just type “disconnect”. Otherwise, if you want to stop SMILA and close the connection, use “close” - as you normally would do on a console running in your shell.

For the complete documentation on eclipse runtime options please see: http://help.eclipse.org/indigo/index.jsp?topic=/org.eclipse.platform.doc.isv/reference/misc/runtime-options.html

Bundles

new bundle was not started

After launching SMILA my new bundle doesn't seem to be started.

If you started SMILA.launch in eclipse to launch SMILA: The launcher didn't start your new bundle. Try this:

  1. Add your bundle by selecting "Run Configurations" in eclipse and choose your SMILA profile.
  2. Select your bundle in the list and set the checkmark.
  3. Set the start level to "4" and the autostart to "true".

If you started SMILA.EXE to launch SMILA: Your bundle isn`t defined in config.ini or the start level isn´t correct. Try this:

  1. Open the file configuration/config.ini and add your bundle as shown below:
    org.eclipse.smila.connectivity.framework.crawler.owncrawler@5:start, \
  2. Open the build.properties file of your bundle and include the folders schemas/, OSGI-INF/, and the file plugin.xml.

I changed bundle settings in my config.ini, but after SMILA restart nothing changed

Check your config.ini for unusual whitespaces (e.g. a tab) between the (edited) bundle entries or after the backslashes on line ends - and remove them.

If that doesn't help, see question (resp. answer) before.


Configuring/Running SMILA

Basics

How do I change the HTTP port of SMILA?

The HTTP port is configured in fie configuration/org.eclipse.smila.clusterconfig.simple/clusterconfig.json. Change it there and restart SMILA.

See SMILA/Documentation/JettyHttpServer#Setting_the_HTTP_port for details.

Crawler

I started a crawl job, the job was started successfully, but nothing seems to happen

Check the SMILA log file for errors.

Check the state of the crawl job in jobmanager state API

  • if the state in your crawl job is "RUNNING", check the mode it has been started
  • if the state is FINISHING
    • check the job into which the crawl job pushes its records for processed records or errors.
  • if everything up to now is OK
    • just wait some time, the autocommit handler maybe hasn't yet committed the data to the Solr index
    • check autoCommit settings in the solrconfig.xml file of your Solr core.

Pipeline

Why do I get a timeout exception during a (long running) pipeline execution?

In SMILA there's a timeout configured for a pipeline execution:

configuration/org.eclipse.smila.processing.bpel/processor.properties

See entry: pipeline.timeout

My xpath expression doesnt work

  1. Check that the expression is correct with a XPath tool
  2. ensure that the attributes are not filtered out, see SMILA/Documentation/HowTo/How_to_filter_and_access_record_data_in_BPEL

Helpful log points

Area id level Comment
BPEL org.eclipse.smila.processing.bpel.PipeletManager debug Log BPEL request XML before and after pipelet invocation. Note: this contains the XML for the records but only their filtered version.
BPEL org.eclipse.smila.processing.bpel.PipeletManager trace Log full XML of records after pipelet invocation. Note: this is the real thing unfiltered.

Implementing Pipelets / OSGi Services / Bundles

Configuration

I want to use the ConfigUtils class in my Pipelet to read the configuration, where do I have to put my configuration files?

Configuration files are searched for in the following order:

  1. <SMILA>/configuration/<bundle-name>/<config-file>
  2. <config-file> in the root path of the bundle jar-file

See Configuration Handler for more information.

Deploy / Launch

I implemented/deployed a OSGi Service in a new bundle but SMILA log says that it couldn't be found

Check your new bundle, it should contain a file like that:

  • OSGI-INF/<myService>.xml

In this file your new service has to be referenced. If you have copied the file from some other service, be sure to change the component name in the root element to something unique, because DS does not start multiple services with the same component name.

  • <component name="<myService>" immediate="true">

Also the file has to be referenced from the MANIFEST.MF file of your bundle as a service component:

  • Service-Component: OSGI-INF/<myService>.xml

Also, you may need to include Import-Package: declarations for super-classes of your service implementation class even if there are no compile errors.

On the "Build" page of the manifest editor, you must add the OSGI-INF directory to the binary build.

And finally, your bundle has to be started at SMILA launch, e.g. by adding it to the config.ini.

If you are using SMILA.launch to launch SMILA, you also have to open the run/debug configuration of SMILA, check the new bundle and set Auto-Start to "true".

I get classloading errors in invocations of my own Pipelet when running SMILA outside the IDE. In the IDE it works

The error could look like this:

2010-11-19 11:28:36,101 ERROR [ODEServerImpl-1 ] vpu.JacobVPU - Method "run" in class "org.apache.ode.bpel.rtrep.v2.EXTENSIONACTIVITY" threw an unexpected exception. java.lang.LinkageError: loader constraint violation: loader (instance of org/eclipse/osgi/internal/baseadaptor/DefaultClassLoader) previously initiated loading for a different type with name "org/w3c/dom/Document"

We are not completely sure, why this happens, but a solution is to set this system property in the SMILA.ini file:

-Dosgi.java.profile.bootdelegation=override

Thamks to Bogdan Sacaleanu for the solution. See this thread in the smila-dev mailing list for additional details.

I get classloading errors when I try to access an external Web Service using JAX-WS

Class loading problems often occur when using third party libraries that use the "thread context classloader" in OSGi, and standard implementations of Java specifications by Sun/Oracle (or other non-OSGi-aware parties...) do this very often.

The only solution we currently know of is to wrap the critical section (in this case it's the construction of the webservice client class) in a piece of code like this:

ClassLoader tccl = Thread.currentThread().getContextClassLoader();
try {
    Thread.currentThread().setContextClassLoader(getClass().getClassLoader());
    /*
     * initialize the client
     */
} finally {
    Thread.currentThread().setContextClassLoader(tccl);
}

See also this forum thread.

More classloader errors - part 1

If you get an error like this:

Caused by: java.lang.LinkageError: loader constraint violation: when resolving field "DATETIME" the class loader (instance of 
 org/eclipse/osgi/internal/baseadaptor/DefaultClassLoader) of the referring class, javax/xml/datatype/DatatypeConstants, and the class loader (instance of <bootloader>) for the field's resolved type, javax/xml/namespace/QName, have different Class objects for that type

the reason is most likely that you have classes in the bundle classpath of your bundle that contains classes which are also part of the JDK runtime library (java.*, javax.*). Remove these classes from the bundle and it should work.

See also this forum thread.

Back to the top