Apache Solr Language Identifier


Apache Solr Language Identifier

This module is intended to be used while indexing documents. It is implemented as an UpdateProcessor to be placed in an UpdateChain. Its purpose is to identify language from documents and tag the document with language code.

Compile bağımlılıklar (105)

Grup / Obje Sürüm Yeni Sürümü
org.htrace » htrace-core 3.0.4 NA
org.apache.pdfbox » pdfbox 1.8.8 3.0.0-alpha2
org.eclipse.jetty » jetty-webapp 9.2.10.v20150310 9.4.44.v20210927
org.eclipse.jetty » jetty-servlet 9.2.10.v20150310 10.0.12
org.eclipse.jetty » jetty-security 9.2.10.v20150310 9.4.44.v20210927
com.googlecode.mp4parser » isoparser 1.0.2 1.1.22
org.eclipse.jetty » jetty-server 9.2.10.v20150310 9.4.44.v20210927
commons-lang » commons-lang 2.6 NA
org.eclipse.jetty » jetty-continuation 9.2.10.v20150310 9.4.44.v20210927
xerces » xercesImpl 2.9.1 RELEASE
org.apache.pdfbox » jempbox 1.8.8 1.8.16
org.noggit » noggit 0.6 0.8
org.apache.pdfbox » fontbox 1.8.8 3.0.0-alpha2
org.eclipse.jetty » jetty-jmx 9.2.10.v20150310 9.4.44.v20210927
org.eclipse.jetty » jetty-io 9.2.10.v20150310 9.4.44.v20210927
org.eclipse.jetty » jetty-xml 9.2.10.v20150310 9.4.44.v20210927
org.restlet.jee » org.restlet 2.3.0 NA
org.restlet.jee » org.restlet.ext.servlet 2.3.0 NA
org.aspectj » aspectjrt 1.8.0 1.9.21.2
org.eclipse.jetty » jetty-http 9.2.10.v20150310 10.0.6
org.eclipse.jetty » jetty-util 9.2.10.v20150310 9.4.44.v20210927
com.googlecode.juniversalchardet » juniversalchardet 1.0.3 NA
org.bouncycastle » bcprov-jdk15 1.45 1.46
org.bouncycastle » bcmail-jdk15 1.45 1.46
de.l3s.boilerpipe » boilerpipe 1.1.0 NA
org.slf4j » slf4j-api 1.7.7 2.0.12
org.slf4j » slf4j-log4j12 1.7.7 2.0.12
org.slf4j » jul-to-slf4j 1.7.7 2.0.12
rome » rome 1.0 NA
org.apache.poi » poi-ooxml-schemas 3.11 4.1.2
org.apache.tika » tika-java7 1.7 1.27
org.ccil.cowan.tagsoup » tagsoup 1.2.1 NA
org.apache.poi » poi-scratchpad 3.11 5.0.0
org.apache.poi » poi 3.11 5.0.0
org.apache.poi » poi-ooxml 3.11 5.0.0
javax.servlet » javax.servlet-api 3.1.0 4.0.1
org.codehaus.woodstox » woodstox-core-asl 4.4.1 NA
org.codehaus.woodstox » stax2-api 3.1.4 4.2.1
com.spatial4j » spatial4j 0.4.1 0.5
it.unimi.dsi » fastutil 6.5.11 8.5.12
com.google.protobuf » protobuf-java 2.5.0 3.25.3
org.apache.tika » tika-core 1.7 1.27
org.apache.tika » tika-xmp 1.7 1.27
org.apache.tika » tika-parsers 1.7 1.27
org.gagravarr » vorbis-java-core 0.6 0.8
com.adobe.xmp » xmpcore 5.1.2 6.1.11
org.apache.james » apache-mime4j-dom 0.7.2 0.8.4
org.apache.james » apache-mime4j-core 0.7.2 0.8.4
org.apache.zookeeper » zookeeper 3.4.6 3.6.3
org.gagravarr » vorbis-java-tika 0.6 0.8
org.apache.lucene » lucene-backward-codecs 5.2.0 9.9.1
org.apache.lucene » lucene-codecs 5.2.0 9.9.1
net.sourceforge.jmatio » jmatio 1.0 NA
org.apache.lucene » lucene-analyzers-common 5.2.0 8.10.1
org.apache.httpcomponents » httpmime 4.4.1 4.5.12
org.apache.lucene » lucene-analyzers-kuromoji 5.2.0 8.10.1
org.apache.httpcomponents » httpclient 4.4.1 4.5.11
commons-configuration » commons-configuration 1.6 1.10
org.apache.lucene » lucene-analyzers-phonetic 5.2.0 8.10.1
org.apache.lucene » lucene-join 5.2.0 9.9.1
org.ow2.asm » asm 4.1 9.2
org.apache.lucene » lucene-memory 5.2.0 9.9.1
org.apache.lucene » lucene-misc 5.2.0 9.9.1
com.google.guava » guava 14.0.1 33.0.0-jre
org.antlr » antlr-runtime 3.5 3.5.2
org.apache.lucene » lucene-queries 5.2.0 9.9.1
org.apache.lucene » lucene-queryparser 5.2.0 9.9.1
commons-collections » commons-collections 3.2.1 3.2.2
org.apache.httpcomponents » httpcore 4.4.1 4.4.15
org.apache.lucene » lucene-core 5.2.0 9.9.1
org.apache.lucene » lucene-expressions 5.2.0 9.9.1
org.apache.lucene » lucene-grouping 5.2.0 9.9.1
joda-time » joda-time 2.2 2.12.7
org.apache.lucene » lucene-highlighter 5.2.0 9.9.1
org.apache.solr » solr-core 5.2.0 9.6.0
commons-fileupload » commons-fileupload 1.2.1 1.4
net.agkn » hll 1.6.0 NA
com.googlecode.concurrentlinkedhashmap » concurrentlinkedhashmap-lru 1.2 1.4.2
org.apache.lucene » lucene-spatial 5.2.0 7.7.3
org.eclipse.jetty » jetty-rewrite 9.2.10.v20150310 10.0.11
org.apache.lucene » lucene-suggest 5.2.0 9.9.1
org.eclipse.jetty » jetty-deploy 9.2.10.v20150310 10.0.6
org.ow2.asm » asm-commons 4.1 9.2
com.drewnoakes » metadata-extractor 2.6.2 2.16.0
org.apache.solr » solr-solrj 5.2.0 9.6.0
com.cybozu.labs » langdetect 1.1-20120112 NA
org.tukaani » xz 1.5 1.9
jdom » jdom 1.0 1.1
com.pff » java-libpst 0.8.1 0.9.3
org.apache.hadoop » hadoop-auth 2.6.0 3.3.1
commons-io » commons-io 2.4 2.11.0
org.apache.hadoop » hadoop-common 2.6.0 3.3.1
com.carrotsearch » hppc 0.5.2 0.9.0
org.apache.hadoop » hadoop-annotations 2.6.0 3.3.1
org.apache.commons » commons-compress 1.8.1 1.21
dom4j » dom4j 1.6.1 1.4-dev-8
org.apache.hadoop » hadoop-hdfs 2.6.0 3.3.1
org.eclipse.jetty » jetty-servlets 9.2.10.v20150310 10.0.11
com.ibm.icu » icu4j 54.1 73.1
com.tdunning » t-digest 3.1 3.3
commons-cli » commons-cli 1.2 1.4
log4j » log4j 1.2.17 NA
org.apache.xmlbeans » xmlbeans 2.6.0 5.0.1
net.arnx » jsonic 1.2.7 1.3.10
commons-codec » commons-codec 1.10 1.15