Search Options
close
Search the following clips:
All Clips
Everyone's Clips
My Guides
Sign Up
Install
Learn More
Login
Nutch excellent crawler Web pour indexer avec Lucene
ccharlebois
follow
0
4-22-2007 9:22 PM
631 views
tags:
nutch
,
lucence
,
apache
,
opensource
,
search
,
jakarta
Add a Comment
Login
to Comment. Not a member yet?
Sign up
Today's Top Clips
Supreme Court Votes to Hide the Truth
The real threat to "traditional" marriage: heterosexual couples who don't think the way they're supposed to
THIS JUST IN: The “Suck and Glare”
UK honour killing 'wake-up call'
Cult of Conservative Christian GOPers Backs Death Penalty for Gays With HIV
9 of the most stupid lawsuits filed by inmates
Financial Crisis Hits Dubai, in pictures [37 photos]
Al Gore: whore
Obama's Aunt: "I loved President Bush"
Video Scenes Pulled from Peoples' Thoughts
visit the
Top Clips page
View the Top Clips from
April 22, 2007
Embed This Clip In Your Site...
<div style="margin: 12px 0px; font-family: arial; color: #333333; background: #ffffff; border: solid 4px #e5e5e5; width: 100%; clear: left;"><div class="CM_CTB_Content_Wrap" style="margin: 0px; padding: 0px;background-color: #ffffff;"><div style="border-bottom: solid 1px #dcdcdc; white-space: nowrap; margin-bottom: 8px; background-color: #eeeeee ;background-image: url(http://clipmarks.com/images/source-bg.gif); background-repeat: repeat-x; height: 24px; line-height: 24px; vertical-align: middle; padding-bottom: 4px; color: #666666; font-size: 10px;" ><a href="http://clipmarks.com/clip-to-blog/" title="see clips that are hot right now"><img src="http://content.clipmarks.com/blog_embed/bd28d9b6-5bda-44b5-a4ca-2ab3737f4ecd/8684B027-5CD1-499E-9513-A30752B16825/" alt="" width="19" height="19" border="0" style="vertical-align: middle; margin: 0px 4px; display: inline; border: none; float:none;" /></a>clipped from <a title="http://lucene.apache.org/nutch/about.html" href="http://lucene.apache.org/nutch/about.html" style="font-size: 11px;">lucene.apache.org</a></div><blockquote style="text-align: left; padding: 0px 8px; margin: 4px 0px 8px 0px; background: transparent; border: none;" cite="http://lucene.apache.org/nutch/about.html"><div align="center"><img src="http://content9.clipmarks.com/blog_cache/lucene.apache.org/img/D6C639D2-83D3-4803-8922-63A2D72633AB" alt="Lucene" /></div></blockquote><div style="height: 2px; font-size: 2px; background: #dcdcdc; border-bottom: solid 1px #f5f5f5; margin: 2px 4px;"></div><blockquote style="text-align: left; padding: 0px 8px; margin: 4px 0px 8px 0px; background: transparent; border: none;" cite="http://lucene.apache.org/nutch/about.html"><DIV class="projectlogo"> <A href="http://lucene.apache.org/nutch/"><IMG title="Open Source Web Search Software" src="http://lucene.apache.org/nutch/images/nutch-logo.gif" alt="Nutch" class="logoImage" /></A> </DIV></blockquote><div style="height: 2px; font-size: 2px; background: #dcdcdc; border-bottom: solid 1px #f5f5f5; margin: 2px 4px;"></div><blockquote style="text-align: left; padding: 0px 8px; margin: 4px 0px 8px 0px; background: transparent; border: none;" cite="http://lucene.apache.org/nutch/about.html"><H1>About Nutch</H1></blockquote><div style="height: 2px; font-size: 2px; background: #dcdcdc; border-bottom: solid 1px #f5f5f5; margin: 2px 4px;"></div><blockquote style="text-align: left; padding: 0px 8px; margin: 4px 0px 8px 0px; background: transparent; border: none;" cite="http://lucene.apache.org/nutch/about.html"><H2 class="h3">Overview</H2></blockquote><div style="height: 2px; font-size: 2px; background: #dcdcdc; border-bottom: solid 1px #f5f5f5; margin: 2px 4px;"></div><blockquote style="text-align: left; padding: 0px 8px; margin: 4px 0px 8px 0px; background: transparent; border: none;" cite="http://lucene.apache.org/nutch/about.html"><P>Nutch is open source web-search software. It builds on <A href="http://lucene.apache.org/java/">Lucene Java</A>, adding web-specifics, such as a crawler, a link-graph database, parsers for HTML and other document formats, etc.</P></blockquote><div style="height: 2px; font-size: 2px; background: #dcdcdc; border-bottom: solid 1px #f5f5f5; margin: 2px 4px;"></div><blockquote style="text-align: left; padding: 0px 8px; margin: 4px 0px 8px 0px; background: transparent; border: none;" cite="http://lucene.apache.org/nutch/about.html"><P>For more information about Nutch, please see the <A href="http://wiki.apache.org/nutch/">Nutch wiki.</A> </P></blockquote></div><div style="margin: 0px 6px 6px 4px;"><table style="font-size: 11px;border-spacing: 0px;padding: 0px;" cellpadding="0" cellspacing="0" width="100%"><tr><td style="background:transparent;border-width:0px;padding:0px;"> </td><td align="right" style="background:transparent;border-width:0px;padding:0px;width:107px" width="107"><a href="http://clipmarks.com/share/8684B027-5CD1-499E-9513-A30752B16825/blog/" title="blog or email this clip"><img src="http://content6.clipmarks.com/images/c2b-foot.png" border="0" alt="blog it" width="107" height="17" style="border-width:0px;padding:0px;margin:0px;" /></a></td></tr></table></div></div>
New from the makers of Clipmarks:
Amplify.com - Don't just share the news...Amplify it!
Clipmarks
Home
New Clips
Top Clips
Dashboard
Popular Topics
News
Life
Science
Technology
Entertainment
Get Started
Sign Up
Install Clipping Tool
How Clipping Works
Clip-to-Blog™
ClipSearch
Tools and Resources
FAQ
ClipWeek
Top Clippers
Top Tags
Site Map
About Clipmarks
About Us
Contact
Copyright
Privacy
EULA
OK