jGuru
Register Email     Password Forgot your
password?
HOME FAQS FORUMS DOWNLOADS ARTICLES PEERSCOPE LEARN

  Search   jGuru Search Help

View:
Lucene FAQ Home Page

FAQ Manager is guru Erik Hatcher PREMIUM.

Lucene is an open-source, high-performance, full-featured text search engine written entirely in Java. It is a technology suitable for nearly any application that requires full-text search.

What's New

How can I index Powerpoint documents?
Tools:Search:Lucene
Otis Gospodnetic PREMIUM, Nov 18, 2004
In order to index Powerpoint documents you need to first parse them to extract text that you want to index from them. You can use the Jakarta Apache POI,...
Can Lucene do a "search within search", so that the second search is constrained by the results of the first query?
Tools:Search:Lucene
Otis Gospodnetic PREMIUM, Jun 4, 2004
Yes. There are two primary options: Use QueryFilter with the previous query as the filter. (you can search the mailing list archives for QueryFilter...
Can I use Lucene to index text in Chinese, Japanese, Korean, and other multi-byte character sets?
Tools:Search:Lucene:Analysis
Otis Gospodnetic PREMIUM, May 6, 2004
Yes, you can. Lucene is not limited to English, nor any other language. To index text properly, you need to use an Analyzer appropriate for the language...

Where does the name Lucene come from?
Tools:Search:Lucene
Otis Gospodnetic PREMIUM, May 5, 2004
Lucene is Doug Cutting's wife's middle name, and her maternal grandmother's first name.
Can I cache search results with Lucene?
Tools:Search:Lucene, Tools:Search:Lucene:Filtering
Otis Gospodnetic PREMIUM, Mar 29, 2004
Lucene does come with a simple cache mechanism, if you use Lucene Filters. The classes to look at are CachingWrapperFilter and QueryFilter.
Why can't I use Lucene with IBM JDK 1.3.1?
Tools:Search:Lucene
Otis Gospodnetic PREMIUM, Mar 8, 2004
Apparently there is a bug in IBM's JIT code in JDK 1.3.1. To work around it, disable JIT for the org.apache.lucene.store.OutputStream.writeInt method...
What is the difference between IndexWriter.addIndexes(IndexReader[]) and IndexWriter.addIndexes(Directory[]), besides them taking different arguments?
Tools:Search:Lucene
Otis Gospodnetic PREMIUM, Nov 18, 2003
When merging lots of indexes (more than the mergeFactor), the Directory-based method will use fewer file handles and less memory, as it will only ever...
If I use a compound file-style index, do I still need to optimize my index?
Tools:Search:Lucene
Otis Gospodnetic PREMIUM, Nov 13, 2003
Yes. Each .cfs file created in the compound file-style index represents a single segment, which means you can still merge multiple segments into a single...
If I have two indexes and use the MultiSearcher will it be faster than only one index with all my documents?
Tools:Search:Lucene
Otis Gospodnetic PREMIUM, Nov 13, 2003
That depends on the environment where MultiSearcher is used. If you have a single computer with a single CPU, then it may actually be a bit slower....
What version of the JDK is required to use Lucene?
Tools:Search:Lucene
Otis Gospodnetic PREMIUM, Oct 8, 2003
Lucene requires at least version 1.2 of the JDK. Versions 1.3 and 1.4 of the JDK are also supported.
Can I search an index while it is being optimized?
Tools:Search:Lucene
Otis Gospodnetic PREMIUM, Sep 2, 2003
Yes, an index can be searched and optimized simultaneously.
How can I index JSP files?
Tools:Search:Lucene
Otis Gospodnetic PREMIUM, Apr 8, 2003
To index the content of JSPs that a user would see using a Web browser, you would need to write an application that acts as a Web client, in order to mimic...
How can I index PDF documents?
Tools:Search:Lucene
Otis Gospodnetic PREMIUM, Apr 7, 2003
In order to index PDF documents you need to first parse them to extract text that you want to index from them. Here are some PDF parsers that can help...
How can I index XML documents?
Tools:Search:Lucene
Otis Gospodnetic PREMIUM, Apr 7, 2003
In order to index XML documents you need to first parse them to extract text that you want to index from them. Here are some XML parsers that can help...
How can I index Word documents?
Tools:Search:Lucene
Otis Gospodnetic PREMIUM, Apr 7, 2003
In order to index Word documents you need to first parse them to extract text that you want to index from them. Here are some Word parsers that can help...
How can I index Excel documents?
Tools:Search:Lucene
Otis Gospodnetic PREMIUM, Apr 7, 2003
In order to index Excel documents you need to first parse them to extract text that you want to index from them. Here are some Excel parsers that can...
How can I index RTF documents?
Tools:Search:Lucene
Otis Gospodnetic PREMIUM, Apr 7, 2003
In order to index RTF documents you need to first parse them to extract text that you want to index from them. Here are some RTF parsers that can help...
How can I index HTML documents?
Tools:Search:Lucene
Otis Gospodnetic PREMIUM, Apr 7, 2003
In order to index HTML documents you need to first parse them to extract text that you want to index from them. Here are some HTML parsers that can help...
Is it safe an index while it is being optimized?
Tools:Search:Lucene
Otis Gospodnetic PREMIUM, Jan 6, 2003
Yes, it is safe.
How do I retrieve all the values of a particular field that exists within an index, across all documents?
Tools:Search:Lucene
Otis Gospodnetic PREMIUM, Dec 31, 2002
The trick is to enumerate terms with that field. Terms are sorted first by field, then by text, so all terms with a given field are adjacent in e...
« previous beginning next »


Ask A Question



Related Links

Lucene Forum

Lucene Homepage

Wish List
Features
About jGuru
Contact Us

 




JupiterOnlineMedia

internet.comearthweb.comDevx.commediabistro.comGraphics.com

Search:

Jupitermedia Corporation has two divisions: Jupiterimages and JupiterOnlineMedia

Jupitermedia Corporate Info


Legal Notices, Licensing, Reprints, & Permissions, Privacy Policy.

Advertise | Newsletters | Tech Jobs | Shopping | E-mail Offers

Solutions
Whitepapers and eBooks
ServerWatch.com Article: Virtualization Enters the SMB World
ITChannelPlanet.com Article: Emerging E-Discovery Market Grows More Vital for VARs
StoreVault Whitepaper: Introduction to Networked Storage
Oracle eBook: Implementing Business Intelligence in Your Organization
Hoovers Sales Tip: Building Credibility with New Clients
SAP Whitepaper: A Corporate Guide to Better Decisions Through IT
Microsoft Article: Flexibility, Agility the Keys to Dynamic IT
Microsoft Article: Improving Infrastructure Management for Mid-Size Companies
IBM Whitepaper: CIO Insights--Ignite Innovation by Fusing Business and IT
IBM Whitepaper: Transforming Legacy Apps into SOA
Internet.com eBook: All About Botnets
Intel Article: 8 Simple Rules for Designing Threaded Applications
Symantec Whitepaper: E-Mail Discovery--Worst-Case Scenarios Versus Best Practices
Ipswitch Whitepaper: Secure File Transfer In the Era of Regulatory Compliance
Symantec Whitepaper: A Unified, Proactive Approach to Endpoint Security
APC eBook: Putting the Green in IT
Intel Whitepaper: Best Practices for Developing and Optimizing Threaded Applications
Symantec Whitepaper: Emerging Trends in Fighting Spam
Oracle eBook: Guide to Oracle 11g and Database Migration
Symantec Whitepaper: Best Practices for IM Archiving & Compliance
Intel Article: The Challenges of Developing Multithreaded Processing Pipelines
Avaya Whitepaper: SIP--Creating Next-Generation Telecom Applications
Avaya Article: Event Processing Demands Real-Time Response for Communications-Enabled Business Apps
MessageLabs Whitepaper: Spam Spikes--A Real Risk to Your Business
Symantec Article: Guarding the Corporate Gateway
MORE WHITEPAPERS, EBOOKS, AND ARTICLES
Webcasts
Epicor Webcast: Requirements to Consider When Looking at ITSM Tools
Intel Video: Three Must-Knows for Parallelism
Microsoft: NXT Web Seminar Series
Microsoft Partner Program Video: The Secrets to Partner Success
Rational Asset Manager: Succeed with Asset-based Development
Microsoft Silverlight Video: Creating Fading Controls with Expression Design and Expression Blend 2
MORE WEBCASTS, PODCASTS, AND VIDEOS
Downloads and eKits
NXPowerLite: Desktop Edition Software Trial
IBM Rational Systems Development Solution e-Kit
IBM SOA Development Survival Guide eKit
Evaluate IBM Rational Build Forge Online
IBM Enterprise Architect eKit for SOA
Evaluate Rational Application Developer Online
Iron Speed Designer Application Generator
Symantec IM Detection Utility
MORE DOWNLOADS, EKITS, AND FREE TRIALS
Tutorials and Demos
IBM Overview Video: Next Generation Data Warehousing
IBM Tutorial: Intro to XML User Interface Language (XUL) Development
Microsoft How-to Article: Get Going with Silverlight and Windows Live
IBM Tutorial: The Ajax Transport Method
IBM Tutorial: Learning PHP
IBM Tutorial: Validating XML
MORE TUTORIALS, DEMOS AND STEP-BY-STEP GUIDES