HTTrack
The Web Mirror Utility

Version en français


Overview (version 1.10, 06/15/98) 1.10b RELEASE is available

HTTrack is an easy-use web site mirror utility. It allows you to transfer a World Wide Web site from the Internet to a local directory, building recursively all structures, getting html, images, and other files from the server to your computer. Links are rebuilt relatively so that you can freely browse to the local site (works with any browser). You can mirror several sites together so that you can jump from one to another. You can, also, update an existing mirror site, or continue an interrupted transfer. The robot is fully configurable, with an integrated help.

WinHTTrack is the Windows9x release of HTTrack. It is available on the ZIP archive of HTTrack.

SUN SOLARIS AND IBM AIX ONLY:
On Sun Solaris and AIX, HTTrack is composed by two programs : the graphic shell, and the robot.
The Shell is an easy way to control the robot, through a graphic interface ; it is available at the HTTrack shell page.
Here you can find the robot, which can be used as a command-line programm.

Do not hesitate to email us for comments.

Features

MouseWare!

This program is FREE, but if you like it, feel free to encourage us: send a mouse! (PC type, PS2 type port), or the necessary fee to buy one (example: $10, £5, FF50 - even $1 can help us!)
It is a little bit stange for a shareware, we know, but we really do need mouses to work properly.. 
If you make an online mirror with HTTrack, please send us a mail, too.

Send mouses (or donation) to:

Xavier Roche & Yann Philippot / HTTrack
ISMRA - ENSI of CAEN
6, boulevard du Marechal Juin
14050 CAEN Cedex
FRANCE

You will be informed (if you want) by email of the new releases of HTTrack.

Usage

The documentation is now available for WinHTTrack and HTTrack.

On SUN/Solaris and IBM/AIX, the simplest way is to use the graphic shell, but you can also use the robot in a console window.
Type httrack (without any parameters) to show the options list. Parameters and adresses must not be in special order.
There are special commands, as the Nx option.

Example:
httrack www.myweb.abc/mydir/index.hml
httrack www.myweb.abc/mydir/index.hml  www.otherweb.abc/~friend/cool/
httrack www.myweb.abc/mydir/index.html www.otherweb.abc/~friend/cool/ -N1 -P proxy.myweb.abc:1234
httrack www.myweb.abc/mydir/index.html +www.otherweb* +*.def -*.com

The first example will transfer the site starting from 'www.myweb.abc/mydir/index.hml' (and, of course, not all www.myweb.abc!)
The second example will mirror 'www.myweb.abc/mydir/index.hml' and 'www.otherweb.abc/~friend/cool/" together. (the third will generate html and images in two directory (option N1), and use a proxy (option P))
The 4th example shows how to use wildcards to accept/refuse all URLs of a certain type. Note that if you specify an URL without any wildcards (*) after '+' or '-' the URL will be accepted/refused, if it exists somewhere. 

Default options are fixed so that you can easily use the command line-mode robot.

Download version 1.10 (06/15/98)

You have taken note this program is not guaranted, and is given "as is". This is the 1.x release of HTTrack.
You can find HTTrack on many ftps around the world (name: httrkXXX.zip, XXX=release number)
(example: simtel.net mirrors)

Ready to use binaries are given. If you work under Unix/Linux, be sure they are executable (x)

Plateform

Click on the proper file to download

Current version

Windows95/98

httrack.zip

1.10b <<< WinHTTrack is included now!

SUN Solaris (5.6)

httrack.tar.gz

1.10b

IBM AIX (4.0)

httrack.tar.gz

1.10b

Linux PC

httrack.tar.gz

1.10b

Shell

The shell release is already included in the Windows version.
For Sun Solaris and AIX, please refer to the HTTrack shell page

Comments

If you like this utility, feel free to encourage us
Comments, problems and bug report are welcome, for the shell and for the robot.

Updates and bugs fixed

1.10b
+ Fixed: Better memory management
+ New: "Transfer rate out" option added (abandon slowests sites)
+ New: "Deaf" hosts do not freeze HTTrack any more
+ Fixed: Again problems with code/codebase tags
+ New: Broken links detection improved

1.04
+ Fixed:Some links were not correctly read (pages with "codebase" tags)
+ Shell: Interface improved

1.03 (No changes for the command-line robot)
+ Shell: Big bug fixed! (VERY slow transfer rates..)

1.02
+ Fixed: Some java files were not correctly transfered
+ New: Speed has been improved
+ Fixed: Log file more accurate
+ Shell: Interface has been improved

1.01
+ Fixed: Structure check error in some cases

1.00 -- The 1.00, Yeah!
+ New: base and codebase are now scanned

0.998 beta-2
+ Fixed: Multiple name bug (files having the same name in the same directory) with -O option fixed

0.997 beta-2
+ Fixed: Filenames with '%' were not correctly named
+ Fixed: Bug detected in 0.996: several files are not written on disk!!

0.996 beta-2
+ New: -O option (path for mirror and log)
+ New: Unmodified file time/date are not changed during an update

0.99 beta-2
+ New: User-agent field
+ New: Shortcuts (--spider etc.)
+ New: Links not retrieved are now rebuilt absolutly
+ New: The 'g' option (juste get files in current directory) has been added
+ New: Primary links analyste has been improved
+ Fixed: "304" bug fixed

0.25 beta-2
+ Fixed: Freeze during several mirrors fixed!
+ New: More 'N' options (filenames type)

0.24 beta-2
+ Fixed: Restart/Update with cache did not work (really not..)
+ Fixed: Jokers now work properly (e.g. +www.abc.com* do works)
+ New: The 'n' option (get non-html files near a link) has been added!

0.23 beta-2
+ Fixed: The 'M' option (site size) did not work
+ Fixed: Files larger than 65Kb were not correctly written

older beta
+ Many, many bugs fixed

Credits

Graphic shell developped by Yann Philippot

Robot developped by Xavier Roche

Project Leaded by Patrick Ducrot and Daniel Carré

Project developped at the ENSI Caen - ISMRa

(C)1998 Xavier Roche & Yann Philippot