- Type:
- All
- Bookmark
- Image
- Note
Lucene/Solr 3.5.0 released
27 November 2011
Added a very substantial (3-5X) RAM reduction required to hold the terms index on opening an IndexReader. (LUCENE-2205)
Added SearcherManager to manage sharing and reopening IndexSearchers across multiple search threads. Underlying IndexReader instances are safely closed if not referenced anymore. (LUCENE-3445, LUCENE-3558)
Added SearcherLifetimeManager which safely provides a consistent view of the index across multiple requests (e.g. paging/drilldown). (LUCENE-3558, LUCENE-3486)
Added NGramPhraseQuery that speeds up phrase queries 30-50% when n-gram analysis is used. (LUCENE-3426)
Improvements to vector highlighting: support for more queries such as wildcards and boundary analysis for generated snippets. (LUCENE-1824, LUCENE-1889)
Highlights of the Solr release include:
Bug fixes and improvements from Apache Lucene 3.5.0, including a very substantial (3-5X) RAM reduction required to hold the terms index on opening an IndexReader. (LUCENE-2205)
Added support for distributed result grouping. (SOLR-2066, SOLR-2776)
Added support for Hunspell stemmer TokenFilter supporting stemming for 99 languages. (SOLR-2769)
A new contrib module "langid" adds language identification capabilities as an Update Processor, using Tika's LanguageIdentifier or Cybozu language-detection library (SOLR-1979)
Numeric types including Trie and date types now support sortMissingFirst/Last. (SOLR-2881)
Added hl.q parameter. It is optional and if it is specified, it overrides q parameter in Highlighter. (SOLR-1926)
Several minor bugfixes like date parsing for years from 0001-1000, ignored configurations when using QueryAnalyzer with SpellCheckComponent and many more. See CHANGES.txt entries for full details.
Go Rogue with Enterprise Search
Implementing corporate search as a skunkworks project, fixing high-profile problems for influential departments. Pretty clever.
Enterprise Search Summit - Day 3 · on Storify
Search-Based Applications with Jeff Fried of BAInsight, presentations from ConceptSearch and SmartLogic, Dynamic Search Interfaces with Shaun Ryan of SLI Systems, and Video Search at PBS.org with Tom Crenshaw, and Nate Treolar of RAMP.
Enterprise Search Summit Fall 2011 - Day 2 · Storify
tweets and posts from the ESS11 day 2
Enterprise Search Summit Fall 2011 - on Storify
Blog posts and tweets from the conference on practical implementations of enterprise search, from mobile interfaces to search analytics to case studies.
Haystack Blog » CIKM 2011 Keynote: User Interfaces that Entice People to Manage Better Information
brynary/webrat - browser
Browser Simulator for expressive, high level acceptance testing (See Webrat::Session)
HtmlUnit - Java browser for testing
A Java tool for testing web user interactions, acting as a browser and recording results. This is great for identifying UI bugs, regression testing, and metrics for comparison
Selenium - Web Browser Automation
Selenium provides a programmatic tool for testing web user interactions, acting as a browser and recording results. This is great for identifying UI bugs, regression testing, and metrics for comparison
Documill (automatic thumbnails for search results)
Intriguing idea, generating thumbnails to show document layout and colors for better scanning and choosing search results.
Enterprise Search Summit Fall 2011
Great practical advice at the workshops, conference and vendor exhibition starts tomorrow
Solr Enterprise Search (solr.pl)
Lively blog by Solr implementers in Poland (in English)
Not coming clean about Enterprise Search < Real Story Group Blog
Underlying issue with search: until organizations start to manage unstructured data with the same care they do structured data, we will continue to have a problem.
Enterprise Search Europe 2011 – conference report
Themes included low search satisfaction, unified information access, SBA (search-based applications), general information management issues
Google Search URL Parameters – Query String Anatomy
Mainly for web search but applies to GSA as well
'Natural' Search User Interfaces | by Marti Hearst | November 2011 | Communications of the ACM
Eminent academic in the field looks at trends for future search interfaces
Daniel Soar reviews 'The Googlisation of Everything (and Why We Should Worry), 'In the Plex' and 'I'm Feeling Lucky'
Post-Rank Reordering: Resolving Preference Misalignments between Search Engines and End Users - Microsoft Research