Frequently Asked Questions (FAQ) and troubleshooting tips are included below regarding Aspire Connectors and Content Processing. Please note that, when available, each of the Aspire Publishers has a subsection for FAQ & Troubleshooting.
If you have questions or wish to provide feedback, contact the Aspire Support group at [email protected].
Both Aspire and MongoDB are high memory consuming applications. Consider limiting the RAM they can allocate by looking into the MongoDB storage engine configuration and setting the Wired Tiger Cache size appropriately. More Info: For example, on a 32GB server, the Aspire Memory allocation should be limited to 16GB, Wired Tiger Cache to 8GB and the remaining RAM will be used for the other MongoDB processes and the OS. JVM changes the pointers' size after 32GB, doubling them and affecting your RAM storage. More info here. Our recommendation is to try to stay below the 32GB limit. The number of available threads/processes and open files are unlimited just for the root user. If you plan to set another user, check that the limits on both numbers. Check here for more information on how to do this. Recommended limit is 64000 for both open files and processes.Do you plan to have MongoDB and Aspire installed on the same machine?
Do you plan to assign more than 32GB of RAM to your Aspire Distribution?
Do you plan to run your Aspire Distribution on Linux with a user other than root?
Custom connector/applications loaded from xml files needs to be on the server, because zookeeper can not keep track of all files uploaded in this way, and the fact that we want to encourage the use of appbundles, xml files are not synchronized by zookeeper.
If you install a connector/applications in this way, you are informed that this connector/applications will be available only in the server installed, if you are in a synchronized environment, you must manually install the same connector/applications on each of the servers.
The trick here is changing the package name in multiple places. It is easiest to let Eclipse handle most of it (via Refactor->Rename).
Note that your JAR may still include a com.searchtechnologies section AS WELL AS your new path because Aspire includes com.searchtechnologies.aspire.framework in every distribution.
Now the project that uses the above JAR needs to be adjusted for its new location:
<component name="customComponent" subType="default" factoryName="com.newgroupid:aspire-custom-component"/>
In the Aspire config file that references your component, change the factory name; for example:
8. In the POM for the enclosing project, find the dependency block for your component and change its groupId:
<dependency> <groupId>com.newgroupid</groupId> <artifactId>aspire-custom-component</artifactId> <version>0.4-SNAPSHOT</version> <scope>jar</scope> </dependency>
9. In the distribution.xml file for the enclosing project, make sure that there is a dependencySet for the new package:
<dependencySet> <outputFileNameMapping>${artifact.artifactId}-${artifact.baseVersion}${dashClassifier?}.${artifact.extension}</outputFileNameMapping> <unpack>false</unpack> <useTransitiveDependencies>false</useTransitiveDependencies> <outputDirectory>bundles/aspire</outputDirectory> <includes> <include>com.newgroupid:*</include> </includes> </dependencySet>
Things to try:
(within Eclipse)
The error is:
Can't rename C:\Documents and Settings\Admin\Desktop\workspaces\aspire01\maven.1260664537843
You will need to delete the stray "maven.1260664537843" folder in your workspace and try again. This might need to be repeated a few times before it works.
Check out the Subversion Console and the Maven Console for more details.
Try to avoid using metadata tags which start with "<rss" or "<feed", e.g., <feederLabel>.
Basically, IE looks for certain tags at the beginning of a random XML file to determine if it's an RSS feed. If it is, it may do something special with it.
When using an RSS 2.0, 0.91, or 0.92 feed, the HTTP Content-Type header should be “text/xml”:
Content-Type: text/xml
Because this is a generic content-type, IE7 will scan the first 512 bytes to look for the “<rss” string that indicates that it is a feed. If the string is found, the file is considered a feed, and the feed preview is shown to the user. For example, this is a RSS 2.0 feed for the IE blog, and IE7 determines that it is a feed because of the highlighted string.
<?xml version="1.0" encoding="UTF-8" ?> <rss version="2.0" ... >
IE may then report that the XML file can not be parsed as an RSS feed (or some such error).
When running under just the component test bench, everything worked fine, but when running under the application test bench, a third-party JAR is unable to locate its own properties file.
Sometimes, third-party JARs use wacky methods for loading classes. In the case of the ROME libraries, do the following under OSGi in order for the ROME JARr to locate its own properties files:
// Reset the classloader for this thread to prevent issues finding the ROME properties file // and classes if (!Thread.currentThread().getContextClassLoader().equals(SyndFeedInput.class.getClassLoader())) { System.out.println("Setting ClassLoader"); Thread.currentThread().setContextClassLoader(SyndFeedInput.class.getClassLoader()); } else { System.out.println("!!! ClassLoader equal"); }
You need to create a classloader containing the current classloader and the JAR file you wish to use. Try the following code:
private void setClassloader(ArrayList<StringBuffer> jarFiles) throws AspireException { ArrayList<URL> urls = new ArrayList<URL>(); // Bail if there are no files. if (jarFiles == null || jarFiles.size() == 0) return; for (StringBuffer file:jarFiles) { // Ignore empty paths if (Utilities.isEmpty(file.toString())) { warn("Ignoring blank jar file specification in component JNDI configuration"); continue; } // Get the path to the file String fullDriverPath = getFilePathFromAspireHome(file.toString().trim()); debug("Adding %s to classpath", fullDriverPath); // Check it exists File jarFile = new File(fullDriverPath); if(!jarFile.exists()) { // Jar file doesn't exist throw new AspireException("aspire.feeders.JMSFeederImpl.jar-not-found", "Unable to locate the configured Jar file. The parameter (\"%s\") is probably incorrect. Full path = \"%s\"", file, fullDriverPath); } // Construct the url for the file and add it to the list of urls try { urls.add(jarFile.toURI().toURL()); } catch (MalformedURLException e) { throw new AspireException("aspire.feeders.JMSFeederImpl.cant-parse-jar-file-name", e, "Unable to convert the Jar file to a URL. The parameter (\"%s\") is probably incorrectly specified. Full path = \"%s\"", file, fullDriverPath); } } // Create a class loader with the new jars _queueClassLoader = URLClassLoader.newInstance(urls.toArray(new URL[urls.size()]), this.getClass().getClassLoader());
You can then either load the classes directly from the loader:
// Load the class _queueClassLoader.loadClass(className);
Or set the current thread's class loader to be the one created:
// Add the jars in to the classloader for this thread if (_queueClassLoader != null) { debug("Adding jars to classloader"); Thread.currentThread().setContextClassLoader(_queueClassLoader); }
I ran into a subtle infinite loop, which I'd like to share with you.
First, my XPath for extracting components from a component-manager configuration is this:
final static AXPath componentXPath = new AXPath("/config/components/component");
And my system XML was this:
<config> <components> <component name="feeder" subType="default" factoryName="Aspire.RSSFeeder"> . . . </config> </component> <component name="pipeline" subType="pipeline" factoryName="aspire.Application"> <config> <components> <component . . . /> . . . </components> </config> </component> </components> </config>
The infinite loop was this:
What happened is that the pipeline, when it got the list of components, because of the leading "/", it actually accessed the components from the top-level of the configuration file, not the components nested within the pipeline-manager's configuration section.
And so, it started all of the components, which meant that it started another copy of itself! (When it then went again to /config/components/component, got the top-level components again, and then started another copy if itself again, etc. etc.)
When accessing your configuration elements with XPATH, do not use a leading "/". If you do, you will access the configuration of the system manager all the way at the top.
You will be passed an "element" as your configuration, which is a <config> element nested within the overall system configuration file. If you are using XPaths to extract information from the element, a leading "/" will go all the way to the top.
(Note that none of this affects how the digester works. You should still use a leading "/" for simple-digesting.)
To fix the problem, change your XPATH to simply: components/component
Note there is no need to prefix it with "config", since we are already "inside" the config node.
This will create unnecessary and destructive <parent> tags inside your stage's pom.xml file. If you discover these, just delete the entire <parent> tag.
It is best to use the archetype to create the stage directly inside your Eclipse project.
Under certain circumstances, the bundles copied to the output directoy are named with a timestamp:
13/05/2010 20:00 4,106,811 aspire-groovy-0.2-20100510.030504-6.jar 13/05/2010 20:00 95,416 aspire-rdb-0.2-20100510.030658-5.jar 13/05/2010 20:00 1,244,640 aspire-rdbfeeder-0.2-20100510.030720-3.jar 13/05/2010 20:00 172,085 aspire-tools-0.2-20100510.031037-5.jar
You can solve this by adding an outputFilenameMapping to the distribution.xml file:
<dependencySet> <outputFileNameMapping>${artifact.artifactId}-${artifact.baseVersion}.${artifact.extension}</outputFileNameMapping> <unpack>false</unpack> <useTransitiveDependencies>false</useTransitiveDependencies> <outputDirectory>bundles/aspire</outputDirectory> <includes> <include>com.searchtechnologies:*</include> </includes> </dependencySet>
Call job.terminate(). This will set a flag which causes the job to be terminated once the current stage completes. See Terminating Jobs for more details.
I got a stange class loadeder issue when loading a component
Exception in thread "Thread-11" java.lang.LinkageError: loader constraint violation: when resolving method "com.searchtechnologies.aspire.services.ServiceUtils.getComponentServiceTracker(Lorg/osgi/framework /BundleContext;Ljava/lang/String;)Lorg/osgi/util/tracker/ServiceTracker;" the class loader (instance of org/apache /felix/framework/ModuleImpl$ModuleClassLoader) of the current class, com/searchtechnologies/aspire/framework /ComponentImpl, and the class loader (instance of org/apache/felix/framework/ModuleImpl$ModuleClassLoader) for resolved class, com/searchtechnologies/aspire/services/ServiceUtils, have different Class objects for the type org/osgi/util/tracker/ServiceTracker used in the signature at com.searchtechnologies.aspire.framework.ComponentImpl.getComponentServiceTracker(ComponentImpl.java:515) at com.searchtechnologies.aspire.framework.BranchInfo.setPipelineManagerName(BranchInfo.java:70) at com.searchtechnologies.aspire.jobErrorHandler.JobErrorHandlerImpl.resubmitJob(JobErrorHandlerImpl.java:432) at com.searchtechnologies.aspire.jobErrorHandler.JobErrorHandlerImpl.resubmitJobs(JobErrorHandlerImpl.java:396) at com.searchtechnologies.aspire.jobErrorHandler.JobErrorHandlerImpl.run(JobErrorHandlerImpl.java:364) at java.lang.Thread.run(Unknown Source)
It turned out to be a missing import package. Adding org.osgi.util.tracker to the <Import-Package> of the pom solved it.
I've also seen this error:
Exception in thread "Thread-5" java.lang.LinkageError: loader constraint violation: when resolving field "NODE" the class loader (instance of org/apache/felix/framework/ModuleImpl$ModuleClassLoader) of the referring class, javax/xml/xpath/XPathConstants, and the class loader (instance of <bootloader>) for the field's resolved type, javax/xml/namespace/QName, have different Class objects for that type at com.searchtechnologies.aspire.framework.AXPath.getElement(AXPath.java:112) at com.searchtechnologies.aspire.framework.AXPath.getElement(AXPath.java:99) at com.searchtechnologies.aspire.framework.ComponentFactoryImpl.persistComponent(ComponentFactoryImpl.java:377) at com.searchtechnologies.aspire.framework.ComponentFactoryImpl.registerComponent(ComponentFactoryImpl.java:333) at com.searchtechnologies.aspire.application.ComponentManagerImpl.registerComponents(ComponentManagerImpl.java:153) at com.searchtechnologies.aspire.application.ComponentManagerImpl.initialize(ComponentManagerImpl.java:65) at com.searchtechnologies.aspire.framework.ComponentFactoryImpl.registerComponent(ComponentFactoryImpl.java:328) at com.searchtechnologies.aspire.application.AspireApplicationComponent.startManager(AspireApplicationComponent.java:177) at com.searchtechnologies.aspire.application.AspireApplicationImpl.autoStart(AspireApplicationImpl.java:82) at com.searchtechnologies.aspire.application.AspireActivator.run(AspireActivator.java:60) at java.lang.Thread.run(Unknown Source)
Again, this was a missing import. I cleaned up the pom and added javax.xml.namespace to the <Import-Package>
One error not caused by the pom:
ERROR: Unable to start system bundle. (java.lang.ClassCastException: com.sun.org.apache.xerces.internal.dom.CommentImpl cannot be cast to org.w3c.dom.Element) java.lang.ClassCastException: com.sun.org.apache.xerces.internal.dom.CommentImpl cannot be cast to org.w3c.dom.Element at com.searchtechnologies.aspire.distributed.discovery.DiscoveryManager.initialize(DiscoveryManager.java:106) at com.searchtechnologies.aspire.distributed.DistributedCommunicationsManager.initialize(DistributedCommunicationsManager.java:137) at com.searchtechnologies.aspire.application.AspireApplicationImpl.doAdditionalInitialization(AspireApplicationImpl.java:69)
This was because of a comment in an XML file being read by the following code:
Element discoveryMethodElement = (Element)discoveryMethodsFromConfig.item(index);
The only solution at the moment is to remove the comment.
See Accessing Other Components for details on how to programatically access one component from another.
Suppose you have a situation where one Aspire component needs to call another component, for example where the RDBFeeder needs to call the RDB connection pool. You need to be careful when declaring the import and export packages in the components pom file and the component dependencies.
Basically, OSGi acts as a broker between components which need to access certain java packages, and components which supply these packages to other components:
In order for OSGi to know how to “wire things up”, it needs to know (through the <Export-Package> and <import-Package> tags in the POM files, exactly what java packages (not, not classes, but entire packages) are available to other components (<Export-Package>) and which ones are need to be accessed from other components (<Import-Package>).
Also, we would strongly recommend that you separate out the methods of your classes, which need to be exported, and include these in a Java “Interface” – and then put this interface into a separate java package – to better control the interface between your packages. This will help organize your thoughts about exactly how packages should communicate to each other. See aspire-rdb and the RDBMSConnectionPool (inside the com.searchtechnologies.aspire.rdb package) for an example.
Your caller bundles should also include a dependency of your called bundle in the pom file.
Typically if you get something wrong you'll see an exception like:
Java.lang.ClassCastException: com.searchtechnologies.aspire.components.DatabaseLogger cannot be cast to com.searchtechnologies.cpatools.Logger
At the point at which you try to access the component:
//import declared import com.searchtechnologies.cpatools.Logger; //this is executed in a method within the initialize() method: String componentName = getStringFromConfig(config, "componentNameDBLogger", null); //this is executed in the process() method and throws the class cast exception Logger logger = (Logger) getComponent(componentName);
If you still have issues, other things to try:
<config name="assets"> <bundles> <filePattern>bundles/aspire/aspire-called-bundle.jar</filePattern> <filePattern>bundles/aspire/aspire-caller-bundle.jar</filePattern> </bundles> <components> . . </components> </config>
If you're using version 0.4-SNAPSHOT or later, this shouldn't be an issue as Aspire will automatically load the dependent bundle first.
When creating components, you'll often need to reference other Aspire components or third-party JAR files. In order to do this, you'll end up with one or more dependencies in the pom file, similar to this:
<dependency> <groupId>com.searchtechnologies</groupId> <artifactId>aspire-simplefeeder</artifactId> <version>0.4-SNAPSHOT</version> </dependency> <dependency> <groupId>com.searchtechnologies</groupId> <artifactId>aspire-ccd</artifactId> <version>0.4-SNAPSHOT</version> <scope>provided</scope> </dependency> <dependency> <groupId>net.java.dev.rome</groupId> <artifactId>rome</artifactId> <version>1.0.0</version> </dependency> <dependency> <groupId>commons-io</groupId> <artifactId>commons-io</artifactId> <version>1.4</version> </dependency>
You should have added these in order to make the code compile. If you didn't, and you added JAR files directly to the build path in Eclipse, then shame on you. You'll regret it later.
However, making it compile is not the same as getting it to run within Aspire.
The first thing to note is that only one of the dependencies in the fragment above has the <scope> tag. If you don't specify this, the scope defaults to compile. The scopes are described below:
compile - Include the third-party JAR in my component (i.e., the bundle JAR file will contain the third-party JAR file).
In the example above, "aspire-simplefeeder" behaves like a "third-party JAR." It will actually be included in your component just like any other JAR from someone else.
Aspire (from 0.4 onwards), when attempting to load a component, assumes any provided dependency (with exceptions such as aspire-application, aspire-services, and org.osgi) is a Full Aspire Component which must be pre-loaded before the current component can be loaded. The Aspire built-in dependency loader is only for complete Aspire components (they must be specified as <packaging>bundle</packaging> in their POM). It will not work for ordinary JAR files. This is by design. Ordinary JAR files must be included in your component (as a compile-time dependency), or they must be exported by another component.
In versions 0.3 and earlier, you need to ensure that a dependent bundle is loaded first by ensuring the file appears first in you <bundles> tag. Again, third-party JAR files must be included in your component (as a compile-time dependency).
If you experience memory issues or JVM core dumps you can enable JMX on Felix and then connect using JConsole.
Just add the following parameters to the JAVA_OPTS in the start up script:
-XX:+HeapDumpOnOutOfMemoryError -Dcom.sun.management.jmxremote -Dcom.sun.management.jmxremote.port=9999 -Dcom.sun.management.jmxremote.authenticate=false -Dcom.sun.management.jmxremote.ssl=false
Restart Aspire and then use JConsole to connect to port 9999.
The permgen is where the JVM keeps the actual classes (not the instances, just the "boiler plates"), including fields, method names, etc.
Given how OSGI works, exposing some classes as "private" within the bundle and other as "public", i.e., available to other bundles, then the classloader(s) could have different dynamic classes for what could be considered copies of the "same" class (which may or may not be true in reality, because they could be different versions of it, with even different functionality).
Hence, OSGI will tend to fill more easily this memory.
The more evident example today is the JMS Feeder; which uses reflection on its implementation, thus indirectly using more of this memory.
All in all, it is a small price to pay (increasing this memory) for all the capabilities we get from the OSGI framework.
Therefore, it is important that we monitor this memory usage. It can be checked using the JConsole.
For setting and/or changing the permgen size, just add/change the MaxPermSize (valid for JAva 1.7 and below) parameter to the JAVA_OPTS in the start up script. An example where we set the size to 256m is below:
-XX:MaxPermSize=256m
If you are using Java 1.8 try setting and/or changing the metaspace size, just add/change the -XX:MaxMetaspaceSize parameter to the JAVA_OPTS in the start up script. An example where we set the size to 256m is below:
-XX:MaxMetaspaceSize=256m
Given its nature, permgen nor metaspace is garbage collected.
In some rare cases, an error in the configuration of pipeline managers and branches can cause a deadlock. To prevent that, be sure that you are not branching sub-jobs to the same pipeline manager doing the branching.
Pipeline Manager A: XMLSubjobExtractor -> PrintToFile
Where XMLSubjobExtractor branch is configured to send jobs to Pipeline Manager A. This will almost certainly cause a deadlock.
Pipeline Manager A (parent jobs): XMLSubjobExtractor
Pipeline Manager B (sub-jobs): PrintToFile
Where XMLSubjobExtractor branch is configured so it sends jobs to Pipeline Manager B.
Notice that the same may happen wherever you have a branch configuration. So be careful when setting the destination of your branches.
org.xml.sax.SAXParseException: Content is not allowed in prolog.
This error is caused by the presence of a byte-order mark (BOM) at the start of your XML file. You can see the BOM using "od -c <filename>". You can remove the BOM simply by copying and pasting the entire file contents to a new file in a text editor since the BOM does not get copied to the clipboard.
The following error was caused by an incorrect Groovy component configuration:
AspireException(GroovyStage.script-execution-error): com.searchtechnologies.aspire.services.AspireException: Error occured while executing the groovy script. (component='/MemoryLane-initial-experiment/ProcessEndecaFiles/SplitEndecaFileAndFeedIndividualDocs', componentFactory='aspire-groovy') at com.searchtechnologies.aspire.components.GroovyStage.process(GroovyStage.java:146) at com.searchtechnologies.aspire.framework.JobHandler.runNested(JobHandler.java:106) at com.searchtechnologies.aspire.framework.JobHandler.run(JobHandler.java:47) at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908) at java.lang.Thread.run(Thread.java:619) Caused by: groovy.lang.MissingPropertyException: No such property: bh for class: Script1 at org.codehaus.groovy.runtime.ScriptBytecodeAdapter.unwrap(ScriptBytecodeAdapter.java:49) at org.codehaus.groovy.runtime.callsite.PogoGetPropertySite.getProperty(PogoGetPropertySite.java:49) at org.codehaus.groovy.runtime.callsite.AbstractCallSite.callGroovyObjectGetProperty(AbstractCallSite.java:241) at Script1.run(Script1.groovy:23) at com.searchtechnologies.aspire.components.GroovyStage.process(GroovyStage.java:142) ... 5 more
The script used a branch handler, but its configuration had been placed outside the <config> block:
<!-- Mis placed <branches> configuration below --> <branches> <branch event="onPublish" pipelineManager="../PrintDocDontIndex" /> </branches>
<component name="SplitEndecaFileAndFeedIndividualDocs" subType="default" factoryName="aspire-groovy"> <config> <script> <![CDATA[ . . AspireDocument subDoc = new AspireDocument(); subDoc.add("text", outputBuffer.toString()); Job subJob = job.createSubJob(subDoc, job.getJobId() + "-" + subjobCount); bh.enqueue(subJob, "onPublish"); . . ]]> </script> </config> <!-- Mis placed <branches> configuration below --> <branches> <branch event="onPublish" pipelineManager="../PrintDocDontIndex" /> </branches> </component>
The correct configuration is shown below:
<!-- Correctly placed <branches> configuration below --> <branches> <branch event="onPublish" pipelineManager="../PrintDocDontIndex" /> </branches>
<component name="SplitEndecaFileAndFeedIndividualDocs" subType="default" factoryName="aspire-groovy"> <config> <script> <![CDATA[ . . AspireDocument subDoc = new AspireDocument(); subDoc.add("text", outputBuffer.toString()); Job subJob = job.createSubJob(subDoc, job.getJobId() + "-" + subjobCount); bh.enqueue(subJob, "onPublish"); . . ]]> </script> <!-- Correctly placed <branches> configuration below --> <branches> <branch event="onPublish" pipelineManager="../PrintDocDontIndex" /> </branches> </config> </component>
The trick here is changing the package name in multiple places. It is easiest to let Eclipse handle most of it (via Refactor->Rename).
Note that your JAR may still include a com.searchtechnologies section AS WELL AS your new path because Aspire includes com.searchtechnologies.aspire.framework in every distribution.
Now the project that uses the above, JAR needs to be adjusted for its new location:
1. In the Aspire config file that references your component, change the factory name; for example:
<component name="customComponent" subType="default" factoryName="com.newgroupid:aspire-custom-component"/>
2. In the POM for the enclosing project, find the dependency block for your component and change its groupId:
<dependency> <groupId>com.newgroupid</groupId> <artifactId>aspire-custom-component</artifactId> <version>0.4-SNAPSHOT</version> <scope>jar</scope> </dependency>
3. In the distribution.xml file for the enclosing project, make sure that there is a dependencySet for the new package:
<dependencySet> <outputFileNameMapping>${artifact.artifactId}-${artifact.baseVersion}${dashClassifier?}.${artifact.extension}</outputFileNameMapping> <unpack>false</unpack> <useTransitiveDependencies>false</useTransitiveDependencies> <outputDirectory>bundles/aspire</outputDirectory> <includes> <include>com.newgroupid:*</include> </includes> </dependencySet>
Things to try:
Try to avoid using metadata tags which start with "<rss" or "<feed", e.g., <feederLabel>.
Basically, IE looks for certain tags at the beginning of a random XML file to determine if it's an RSS feed. If it is, it may do something special with it.
When using an RSS 2.0, 0.91, or 0.92 feed, the HTTP Content-Type header should be “text/xml”:
Content-Type: text/xml
Because this is a generic content-type, IE7 will scan the first 512 bytes to look for the “<rss” string that indicates that it is a feed. If the string is found, the file is considered a feed, and the feed preview is shown to the user. For example, this is a RSS 2.0 feed for the IE blog, and IE7 determines that it is a feed because of the highlighted string.
<?xml version="1.0" encoding="UTF-8" ?> <rss version="2.0" ... >
IE may then report that the XML file can not be parsed as an RSS feed (or some such error).
When running under just the component test bench, everything worked fine, but when running under the application test bench, a third-party JAR is unable to locate its own properties file.
Sometimes, third-party JARs use wacky methods for loading classes. In the case of the ROME libraries, do the following under OSGi in order for the ROME JARr to locate its own properties files:
// Reset the classloader for this thread to prevent issues finding the ROME properties file // and classes if (!Thread.currentThread().getContextClassLoader().equals(SyndFeedInput.class.getClassLoader())) { System.out.println("Setting ClassLoader"); Thread.currentThread().setContextClassLoader(SyndFeedInput.class.getClassLoader()); } else System.out.println("!!! ClassLoader equal");
You need to create a classloader containing the current classloader and the JAR file you wish to use. Try the following code:
private void setClassloader(ArrayList<StringBuffer> jarFiles) throws AspireException { ArrayList<URL> urls = new ArrayList<URL>(); // Bail if there are no files. if (jarFiles == null || jarFiles.size() == 0) return; for (StringBuffer file:jarFiles) { // Ignore empty paths if (Utilities.isEmpty(file.toString())) { warn("Ignoring blank jar file specification in component JNDI configuration"); continue; } // Get the path to the file String fullDriverPath = getFilePathFromAspireHome(file.toString().trim()); debug("Adding %s to classpath", fullDriverPath); // Check it exists File jarFile = new File(fullDriverPath); if(!jarFile.exists()) { // Jar file doesn't exist throw new AspireException("aspire.feeders.JMSFeederImpl.jar-not-found", "Unable to locate the configured Jar file. The parameter (\"%s\") is probably incorrect. Full path = \"%s\"", file, fullDriverPath); } // Construct the url for the file and add it to the list of urls try { urls.add(jarFile.toURI().toURL()); } catch (MalformedURLException e) { throw new AspireException("aspire.feeders.JMSFeederImpl.cant-parse-jar-file-name", e, "Unable to convert the Jar file to a URL. The parameter (\"%s\") is probably incorrectly specified. Full path = \"%s\"", file, fullDriverPath); } } // Create a class loader with the new jars _queueClassLoader = URLClassLoader.newInstance(urls.toArray(new URL[urls.size()]), this.getClass().getClassLoader());
You can then either load the classes directly from the loader:
// Load the class _queueClassLoader.loadClass(className);
Or set the current thread's class loader to be the one created:
// Add the jars in to the classloader for this thread if (_queueClassLoader != null) { debug("Adding jars to classloader"); Thread.currentThread().setContextClassLoader(_queueClassLoader); }
I ran into a subtle infinite loop, which I'd like to share with you.
First, my XPath for extracting components from a component-manager configuration is this:
final static AXPath componentXPath = new AXPath("/config/components/component");
And my system XML was this:
<config> <components> <component name="feeder" subType="default" factoryName="Aspire.RSSFeeder"> . . . </config> </component> <component name="pipeline" subType="pipeline" factoryName="aspire.Application"> <config> <components> <component . . . /> . . . </components> </config> </component> </components> </config>
The infinite loop was this:
What happened is that the pipeline, when it got the list of components, because of the leading "/", it actually accessed the components from the top-level of the configuration file, not the components nested within the pipeline-manager's configuration section.
And so, it started all of the components, which meant that it started another copy of itself! (When it then went again to /config/components/component, got the top-level components again, and then started another copy if itself again, etc. etc.)
When accessing your configuration elements with XPATH, do not use a leading "/". If you do, you will access the configuration of the system manager all the way at the top.
You will be passed an "element" as your configuration, which is a <config> element nested within the overall system configuration file. If you are using XPaths to extract information from the element, a leading "/" will go all the way to the top.
Note that none of this affects how the digester works. You should still use a leading "/" for simple-digesting.
To fix the problem, change your XPATH to simply: components/component
Note there is no need to prefix it with "config", since we are already "inside" the config node.
This will create unnecessary and destructive <parent> tags inside your stage's pom.xml file. If you discover these, just delete the entire <parent> tag.
It is best to use the archetype to create the stage directly inside your Eclipse project.
Under certain circumstances, the bundles copied to the output directoy are named with a timestamp:
13/05/2010 20:00 4,106,811 aspire-groovy-0.2-20100510.030504-6.jar 13/05/2010 20:00 95,416 aspire-rdb-0.2-20100510.030658-5.jar 13/05/2010 20:00 1,244,640 aspire-rdbfeeder-0.2-20100510.030720-3.jar 13/05/2010 20:00 172,085 aspire-tools-0.2-20100510.031037-5.jar
You can solve this by adding an outputFilenameMapping (line 2) to the distribution.xml file:
<dependencySet> <outputFileNameMapping>${artifact.artifactId}-${artifact.baseVersion}.${artifact.extension}</outputFileNameMapping> <unpack>false</unpack> <useTransitiveDependencies>false</useTransitiveDependencies> <outputDirectory>bundles/aspire</outputDirectory> <includes> <include>com.searchtechnologies:*</include> </includes> </dependencySet>
Call job.terminate(). This will set a flag which causes the job to be terminated once the current stage completes. See Terminating Jobs for more details.
I got a stange class loadeder issue when loading a component
Exception in thread "Thread-11" java.lang.LinkageError: loader constraint violation: when resolving method "com.searchtechnologies.aspire.services.ServiceUtils.getComponentServiceTracker(Lorg/osgi/framework /BundleContext;Ljava/lang/String;)Lorg/osgi/util/tracker/ServiceTracker;" the class loader (instance of org/apache /felix/framework/ModuleImpl$ModuleClassLoader) of the current class, com/searchtechnologies/aspire/framework /ComponentImpl, and the class loader (instance of org/apache/felix/framework/ModuleImpl$ModuleClassLoader) for resolved class, com/searchtechnologies/aspire/services/ServiceUtils, have different Class objects for the type org/osgi/util/tracker/ServiceTracker used in the signature at com.searchtechnologies.aspire.framework.ComponentImpl.getComponentServiceTracker(ComponentImpl.java:515) at com.searchtechnologies.aspire.framework.BranchInfo.setPipelineManagerName(BranchInfo.java:70) at com.searchtechnologies.aspire.jobErrorHandler.JobErrorHandlerImpl.resubmitJob(JobErrorHandlerImpl.java:432) at com.searchtechnologies.aspire.jobErrorHandler.JobErrorHandlerImpl.resubmitJobs(JobErrorHandlerImpl.java:396) at com.searchtechnologies.aspire.jobErrorHandler.JobErrorHandlerImpl.run(JobErrorHandlerImpl.java:364) at java.lang.Thread.run(Unknown Source)
It turned out to be a missing import package. Adding org.osgi.util.tracker to the <Import-Package> of the pom solved it.
I've also seen this error:
Exception in thread "Thread-5" java.lang.LinkageError: loader constraint violation: when resolving field "NODE" the class loader (instance of org/apache/felix/framework/ModuleImpl$ModuleClassLoader) of the referring class, javax/xml/xpath/XPathConstants, and the class loader (instance of <bootloader>) for the field's resolved type, javax/xml/namespace/QName, have different Class objects for that type at com.searchtechnologies.aspire.framework.AXPath.getElement(AXPath.java:112) at com.searchtechnologies.aspire.framework.AXPath.getElement(AXPath.java:99) at com.searchtechnologies.aspire.framework.ComponentFactoryImpl.persistComponent(ComponentFactoryImpl.java:377) at com.searchtechnologies.aspire.framework.ComponentFactoryImpl.registerComponent(ComponentFactoryImpl.java:333) at com.searchtechnologies.aspire.application.ComponentManagerImpl.registerComponents(ComponentManagerImpl.java:153) at com.searchtechnologies.aspire.application.ComponentManagerImpl.initialize(ComponentManagerImpl.java:65) at com.searchtechnologies.aspire.framework.ComponentFactoryImpl.registerComponent(ComponentFactoryImpl.java:328) at com.searchtechnologies.aspire.application.AspireApplicationComponent.startManager(AspireApplicationComponent.java:177) at com.searchtechnologies.aspire.application.AspireApplicationImpl.autoStart(AspireApplicationImpl.java:82) at com.searchtechnologies.aspire.application.AspireActivator.run(AspireActivator.java:60) at java.lang.Thread.run(Unknown Source)
Again, this was a missing import. I cleaned up the pom and added javax.xml.namespace to the <Import-Package>
One error not caused by the pom:
ERROR: Unable to start system bundle. (java.lang.ClassCastException: com.sun.org.apache.xerces.internal.dom.CommentImpl cannot be cast to org.w3c.dom.Element) java.lang.ClassCastException: com.sun.org.apache.xerces.internal.dom.CommentImpl cannot be cast to org.w3c.dom.Element at com.searchtechnologies.aspire.distributed.discovery.DiscoveryManager.initialize(DiscoveryManager.java:106) at com.searchtechnologies.aspire.distributed.DistributedCommunicationsManager.initialize(DistributedCommunicationsManager.java:137) at com.searchtechnologies.aspire.application.AspireApplicationImpl.doAdditionalInitialization(AspireApplicationImpl.java:69)
This was because of a comment in an XML file being read by the following code:
Element discoveryMethodElement = (Element)discoveryMethodsFromConfig.item(index);
The only solution at the moment is to remove the comment.
See Accessing Other Components for details on how to programatically access one component from another.
Suppose you have a situation where one Aspire component needs to call another component, for example where the RDBFeeder needs to call the RDB connection pool. You need to be careful when declaring the import and export packages in the components pom file and the component dependencies.
Basically, OSGi acts as a broker between components which need to access certain java packages, and components which supply these packages to other components:
In order for OSGi to know how to “wire things up”, it needs to know (through the <Export-Package> and <import-Package> tags in the POM files, exactly what java packages (not, not classes, but entire packages) are available to other components (<Export-Package>) and which ones are need to be accessed from other components (<Import-Package>).
Also, we would strongly recommend that you separate out the methods of your classes, which need to be exported, and include these in a Java “Interface” – and then put this interface into a separate java package – to better control the interface between your packages. This will help organize your thoughts about exactly how packages should communicate to each other. See aspire-rdb and the RDBMSConnectionPool (inside the com.searchtechnologies.aspire.rdb package) for an example.
Your caller bundles should also include a dependency of your called bundle in the pom file.
Typically if you get something wrong you'll see an exception like:
Java.lang.ClassCastException: com.searchtechnologies.aspire.components.DatabaseLogger cannot be cast to com.searchtechnologies.cpatools.Logger
At the point at which you try to access the component:
//import declared import com.searchtechnologies.cpatools.Logger; //this is executed in a method within the initialize() method: String componentName = getStringFromConfig(config, "componentNameDBLogger", null); //this is executed in the process() method and throws the class cast exception Logger logger = (Logger) getComponent(componentName);
If you still have issues, other things to try:
<config name="assets"> <bundles> <filePattern>bundles/aspire/aspire-called-bundle.jar</filePattern> <filePattern>bundles/aspire/aspire-caller-bundle.jar</filePattern> </bundles> <components> . . </components> </config>
If you're using version 0.4-SNAPSHOT or later, this shouldn't be an issue as Aspire will automatically load the dependent bundle first.
When creating components, you'll often need to reference other Aspire components or third-party JAR files. In order to do this, you'll end up with one or more dependencies in the pom file, similar to this:
<dependency> <groupId>com.searchtechnologies</groupId> <artifactId>aspire-simplefeeder</artifactId> <version>0.4-SNAPSHOT</version> </dependency> <dependency> <groupId>com.searchtechnologies</groupId> <artifactId>aspire-ccd</artifactId> <version>0.4-SNAPSHOT</version> <scope>provided</scope> </dependency> <dependency> <groupId>net.java.dev.rome</groupId> <artifactId>rome</artifactId> <version>1.0.0</version> </dependency> <dependency> <groupId>commons-io</groupId> <artifactId>commons-io</artifactId> <version>1.4</version> </dependency>
You should have added these in order to make the code compile. If you didn't, and you added JAR files directly to the build path in Eclipse, then shame on you. You'll regret it later.
However, making it compile is not the same as getting it to run within Aspire.
The first thing to note is that only one of the dependencies in the fragment above has the <scope> tag. If you don't specify this, the scope defaults to compile. The scopes are described below:
Aspire (from 0.4 onwards), when attempting to load a component, assumes any provided dependency (with exceptions such as aspire-application, aspire-services, and org.osgi) is a Full Aspire Component which must be pre-loaded before the current component can be loaded. The Aspire built-in dependency loader is only for complete Aspire components (they must be specified as <packaging>bundle</packaging> in their POM). It will not work for ordinary JAR files. This is by design. Ordinary JAR files must be included in your component (as a compile-time dependency), or they must be exported by another component.
In versions 0.3 and earlier, you need to ensure that a dependent bundle is loaded first by ensuring the file appears first in you <bundles> tag. Again, third-party JAR files must be included in your component (as a compile-time dependency).
If you experience memory issues or JVM core dumps you can enable JMX on Felix and then connect using JConsole.
Just add the following parameters to the JAVA_OPTS in the start up script:
-XX:+HeapDumpOnOutOfMemoryError -Dcom.sun.management.jmxremote -Dcom.sun.management.jmxremote.port=9999 -Dcom.sun.management.jmxremote.authenticate=false -Dcom.sun.management.jmxremote.ssl=false
Restart Aspire and then use JConsole to connect to port 9999.
The permgen is where the JVM keeps the actual classes (not the instances, just the "boiler plates"), including fields, method names, etc.
Given how OSGI works, exposing some classes as "private" within the bundle and other as "public", i.e., available to other bundles, then the classloader(s) could have different dynamic classes for what could be considered copies of the "same" class (which may or may not be true in reality, because they could be different versions of it, with even different functionality).
Hence, OSGI will tend to fill more easily this memory.
The more evident example today is the JMS Feeder; which uses reflection on its implementation, thus indirectly using more of this memory.
All in all, it is a small price to pay (increasing this memory) for all the capabilities we get from the OSGI framework.
Therefore, it is important that we monitor this memory usage. It can be checked using the JConsole.
For setting and/or changing the permgen size, just add/change the MaxPermSize parameter to the JAVA_OPTS in the start up script. An example where we set the size to 256m is below:
-XX:MaxPermSize=256m
Given its nature, permgen is not garbage collected.
In some rare cases, an error in the configuration of pipeline managers and branches can cause a deadlock. To prevent that, be sure that you are not branching sub-jobs to the same pipeline manager doing the branching.
Pipeline Manager A: XMLSubjobExtractor -> PrintToFile
Where XMLSubjobExtractor branch is configured to send jobs to Pipeline Manager A. This will almost certainly cause a deadlock.
Pipeline Manager A (parent jobs): XMLSubjobExtractor
Pipeline Manager B (sub-jobs): PrintToFile
Where XMLSubjobExtractor branch is configured so it sends jobs to Pipeline Manager B.
Notice that the same may happen wherever you have a branch configuration. So be careful when setting the destination of your branches.
1. Go to the Aspire debug console.
2. In the first table, find and click on your Ldap Cache application.
3. On the next page, click CacheLoadScheduler.
4, Click Start.
Some connectors perform incremental crawls based on snapshot entries, which are meant to match the exact documents that have been indexed by the connector to the search engine. On an incremental crawl, the connector fully crawls the repository the same way as a full crawl, but it only indexes the modified, new or deleted documents during that crawl.
For a discussion on crawling, see Full & Incremental Crawls.
Failing to save a content source before creating or editing another content source can result in an error.
ERROR [aspire]: Exception received attempting to get execute component command com.searchtechnologies.aspire.services.AspireException: Unable to find content source
Save the initial content source before creating or working on another.
After a crawl has finished, the connector status may not be updated correctly.
To confirm this, do the following:
1. In RoboMongo, go to your connector database (like: aspire-nameOfYourConnector).
2. Open the "Status" collection and perform the following query:
db.getCollection('status').find({}).limit(1).sort({$natural:-1})
3, Edit the entry and set the status to "S" (Completed).
Note: To see the full options of "Status" values, see MongoDB Collection Status.
Make sure your connector has a manual scheduler configured for Group Expansion.
1, Go to the Aspire debug console, and look for the respective scheduler (in the fourth table: Aspire Application Scheduler).
2. If you are unsure which scheduler is for Group Expansion, you can check the Schedule Detail.
3.To run the Group Expansion process, click Run.
Before version 2.2, Aspire was developed using Java 6 and requires the Java Development Kit (JDK) (rather than just runtime libraries) to be installed. You should use the latest available release suitable for your platform from here.
Beginning with release 2.2, Aspire is developed using Java 7 and requires the Java 7 JDK to run.
And Aspire 3.3 was developed using Java 7 but now uses Java 8
Yes, since release 3.1
Check your repository configuration in settings.xml. This usually means that there are problems accessing the configured repositories, most commonly invalid credentials. (You can confirm by looking at the standard output of Aspire; you should be able to see errors fetching components from the configured repository.)
If you have configured your distribution for no Internet Access, you should rename the file config/available-applications-template.xml to config/available-applications.xml. This file will be used to populate the list of available applications to install.
Yes. We updated the script file used to support Macs.
Go to Launch Control for more details.
AIX uses IBM JDK implementation. Currently there are known compatibility issues between Aspire and IBM JDK implementation.
You should still be able to run Aspire, but be wary of unforeseen issues.