- Notifications
You must be signed in to change notification settings - Fork5
Apache Nutch is an extensible and scalable web crawler
License
apache/nutch-webapp
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
For the latest information about Nutch, please visit our website at:
and our wiki, at:
https://cwiki.apache.org/confluence/display/NUTCH/Home
The Nutch WebApp is built using theApache Wicket Java web framework andSpring.
N.B. Currently, you must have a runningNutch REST Server on the same host.
You can easily run the WebApp by executing the following
% mvn jetty:run
If you want to run the WebApp in aJakarta Servlet container i.e.Apache Tomcat, then run the following
% mvn clean install -DskipTests5 cp target/nutch-webapp-1.0-SNAPSHOT.war$CATALINA_HOME/webapps
You can then access the WebApp on the Tomcat host on port 8080.
To contribute a patch, follow these instructions (note that installingHub is not strictly required, but is recommended).
0. Download and install hub.github.com1. File JIRA issue for your fix at https://issues.apache.org/jira/projects/NUTCH/issues- you will get issue id NUTCH-xxx where xxx is the issue ID.2. git clone https://github.com/apache/nutch-webapp.git3. cd nutch-webapp4. git checkout -b NUTCH-xxx5. edit files (please try and include a test case if possible)6. git status (make sure it shows what files you expected to edit)7. Make sure that your code complies with the [Nutch codeformatting template](https://raw.githubusercontent.com/apache/nutch/master/eclipse-codeformat.xml), which is basially two space indents8. git add <files>9. git commit -m “fix for NUTCH-xxx contributed by <your username>”10. git fork11. git push -u <your git username> NUTCH-xxx12. git pull-request
Generate Eclipse project files
mvn eclipse:eclipse
and follow the instructions inImporting existing projects.
IntelliJ IDEA users can also import Eclipse projects using the"Eclipser" pluginhttps://plugins.jetbrains.com/plugin/7153-eclipser), see alsoImporting Eclipse Projects into IntelliJ IDEA.
About
Apache Nutch is an extensible and scalable web crawler
Topics
Resources
License
Code of conduct
Security policy
Uh oh!
There was an error while loading.Please reload this page.
Stars
Watchers
Forks
Releases
Packages0
Uh oh!
There was an error while loading.Please reload this page.