How to use site sitesucker
- HOW TO USE SITE SITESUCKER MANUAL
- HOW TO USE SITE SITESUCKER OFFLINE
- HOW TO USE SITE SITESUCKER DOWNLOAD
- HOW TO USE SITE SITESUCKER MAC
- HOW TO USE SITE SITESUCKER WINDOWS
We want to enforce a maximum level of 1 now. In SiteSucker, go back into the settings and check the Limits. Now, we have a very basic html page containing all the links from the old site. Wrap it in with basic html & body tags and change the extension to.
In your favorite editor search/replace the old domain with the test (or new) domain. Go ahead and take the extra time to wrap them up in a nice anchor tag (this will help in the next step).Ĭopy this file and rename it to reflect the new site where you want to test your redirects. Take the finished log output and snip away the unneeded text to the left and right of the url. You’re off to the races!įor ~20k links, it took almost 20 minutes for SiteSucker to grab them all. Then, enter the original sitename in the Web URL input and hit enter.
HOW TO USE SITE SITESUCKER DOWNLOAD
What I wondered was how I could execute a web crawl not from a site, but from a saved file? Turns out, it’s very easy!įirst, ensure the settings of SiteSucker to log the download history (and save that log):
HOW TO USE SITE SITESUCKER MAC
It’s extremely fast and the user interface is very lean (making use of Mac OS’s Console logging application). I’d used SiteSucker a few times in recent months to double check the health of our site’s link structure. And, of course, I knew we didn’t have that many valid pages in our site. I calculated at the rate I needed to go through the top 1000, I’d need almost 7 years to double check the rest. This was horribly redundant, mind-numbing work but I didn’t see a way to export all 5 MILLION URLs that Google Analytics had on on record for the last month. In fact, SiteSucker was able to confirm that, besides images, we did manage a 100% conversion for all the existing urls.Īt first, I spent about a day going through the top 1000 URLs according to the Google Analytics tracker. Ultimately, I had to make a compromise between feasibility and correctness, but I’m pretty satisified with the results.
With over twenty thousand pages, it was no small task and I struggled finding a way to automate it. In relaunching one of our decade old platforms, we couldn’t afford to get bashed by fickle finger of Google and I needed to take extra care in ensuring all redirects were properly made. You can prevent this behavior by changing the settings.Coda My boss threw down the gauntlet Monday morning during our weekly meeting. Sometimes it downloads large files you don’t want or follows links to parts of a site you are not interested in.
HOW TO USE SITE SITESUCKER MANUAL
Make sure you are viewing the manual that matches your version of the software. Interested parties are invited to view the SiteSucker manual.
HOW TO USE SITE SITESUCKER OFFLINE
This makes it possible to browse the offline copy without the internet. The website will have been “localized”, meaning all the links will have been altered to work on your local machine.
You can now click on Open File to open your downloaded website in your default web browser. When the download is finished the window will look the same as before you started except that the Files Downloaded counter will have a value larger than zero. The program will continue to download all the files on the site. The lower half of the screen should start flashing with URLs and filenames. Paste the URL of the website you are interested in downloading in Web URL box, and click download. You should be greeted by the window above. Newer versions are only available through the app store. Get the version for your iteration of OSX.
HOW TO USE SITE SITESUCKER WINDOWS
I will post Windows instructions in the future. Note that these instructions are for a Mac using OSX. If there is a useful website that you want to save for posterity you can use the following method. Contrary to popular belief the web is not static.