Skip to content
Snippets Groups Projects
  1. Apr 15, 2006
  2. Apr 03, 2006
  3. Feb 27, 2006
  4. Feb 01, 2006
  5. Jan 29, 2006
  6. Jan 20, 2006
  7. Dec 14, 2005
  8. Dec 09, 2005
  9. Nov 04, 2005
  10. Nov 03, 2005
  11. Oct 07, 2005
  12. Sep 02, 2005
  13. Aug 16, 2005
  14. Aug 05, 2005
  15. Jun 19, 2005
  16. May 06, 2005
  17. May 05, 2005
  18. May 04, 2005
  19. Apr 30, 2005
  20. Mar 27, 2005
  21. Mar 18, 2005
  22. Jan 07, 2005
  23. Nov 27, 2004
  24. Oct 31, 2004
    • Dries Buytaert's avatar
      - Patch #12232 by Steven/UnConed: search module improvements. · 8daed9cb
      Dries Buytaert authored
      1) Clean up the text analyser: make it handle UTF-8 and all sorts of characters. The word splitter now does intelligent splitting into words and supports all Unicode characters. It has smart handling of acronyms, URLs, dates, ...
      
      2) It now indexes the filtered output, which means it can take advantage of HTML tags. Meaningful tags (headers, strong, em, ...) are analysed and used to boost certain words scores. This has the side-effect of allowing the indexing of PHP nodes.
      
      3) Link analyser for node links. The HTML analyser also checks for links. If they point to a node on the current site (handles path aliases) then the link's words are counted as part of the target node. This helps bring out commonly linked FAQs and answers to the top of the results.
      
      4) Index comments along with the node. This means that the search can make a difference between a single node/comment about 'X' and a whole thread about 'X'. It also makes the search results much shorter and more relevant (before this patch, comments were even shown first).
      
      5) We now keep track of total counts as well as a per item count for a word. This allows us to divide the word score by the total before adding up the scores for different words, and automatically makes noisewords have less influence than rare words. This dramatically improves the relevancy of multiword searches. This also makes the disadvantage of now using OR searching instead of AND searching less problematic.
      
      6) Includes support for text preprocessors through a hook. This is required to index Chinese and Japanese, because these languages do not use spaces between words. An external utility can be used to split these into words through a simple wrapper module. Other uses could be spell checking (although it would have no UI).
      
      7) Indexing is now regulated: only a certain amount of items will be indexed per cron run. This prevents PHP from running out of memory or timing out. This also makes the reindexing required for this patch automatic. I also added an index coverage estimate to the search admin screen.
      
      8) Code cleanup! Moved all the search stuff from common.inc into search.module, rewired some hooks and simplified the functions used. The search form and results now also use valid XHTML and form_ functions. The search admin was moved from search/configure to admin/search for consistency.
      
      9) Improved search output: we also show much more info per item: date, author, node type, amount of comments and a cool dynamic excerpt à la Google. The search form is now much more simpler and the help is only displayed as tips when no search results are found.
      
      10) By moving all search logic to SQL, I was able to add a pager to the search results. This improves usability and performance dramatically.
      8daed9cb
  25. Sep 18, 2004
  26. Sep 17, 2004
  27. Sep 15, 2004
    • Dries Buytaert's avatar
      · e31fef12
      Dries Buytaert authored
      - Patch #9988 by MegaGrunt: visual improvements to the tabs in Pushbutton.
      e31fef12
  28. Sep 09, 2004
  29. Aug 20, 2004
Loading