{"id":491,"date":"2022-03-11T19:05:15","date_gmt":"2022-03-11T18:05:15","guid":{"rendered":"https:\/\/wp.catedu.es\/zgzsur\/?page_id=491"},"modified":"2022-03-11T19:05:15","modified_gmt":"2022-03-11T18:05:15","slug":"how-to-use-archive-org","status":"publish","type":"page","link":"https:\/\/wp.catedu.es\/zgzsur\/how-to-use-archive-org\/","title":{"rendered":"How to use&#8230;. ARCHIVE.ORG"},"content":{"rendered":"\n<div class=\"twitter-share\"><a href=\"https:\/\/twitter.com\/intent\/tweet?via=cpizaragozasur\" class=\"twitter-share-button\" data-size=\"large\">Twittear<\/a><\/div>\n<p><a href=\"https:\/\/archive.org\/\" target=\"_blank\" rel=\"noreferrer noopener\">The Internet Archive<\/a>, commonly known as the Wayback Machine allows users to visit archived versions of websites.<\/p>\n<p>The Internet Archive has been archiving sites since 1996 and has 514 billion archived web pages!<\/p>\n<p>If you are wondering how you can use the Internet Archive in your OSINT research, you\u2019ve come to the right place. There are many methods to extract important information from the Wayback Machine to further your OSINT investigations. If you are looking to see historical versions of a website due to the site being deleted or replaced with new content, the Wayback Machine can help. You may need to verify that a target previously worked at a company but the current state of the site does not have the target\u2019s information there. Sometimes a target may intentionally hide information from their present website, looking at older dates of the site may reveal new information. Sometimes you can gather relevant data like names, phone numbers, email addresses, and even metadata from older versions of a website. Let\u2019s explore search methods\u2026<\/p>\n<p><strong>Quick Search Methods:<\/strong><\/p>\n<ul>\n<li>The quickest method to see all the files archived on a particular site are by visiting the URL\u00a0<a href=\"https:\/\/web.archive.org\/*\/www.yoursite.com\" target=\"_blank\" rel=\"noreferrer noopener\">https:\/\/web.archive.org\/*\/www.example.com<\/a>\u00a0and replacing\u00a0<a href=\"http:\/\/www.example.com\/\" rel=\"nofollow\">http:\/\/www.example.com<\/a>\u00a0with the site of your interest.<\/li>\n<li><span style=\"font-size: 14pt;\"><strong>Example: <a href=\"https:\/\/web.archive.org\/web\/*\/www.colegiozaragozasur.es\" target=\"_blank\" rel=\"noreferrer noopener\">https:\/\/web.archive.org\/web\/*\/www.colegiozaragozasur.es<\/a><\/strong><\/span><\/li>\n<\/ul>\n<p>If the site has been archived, a calendar view will appear with colour coded dots which have different meanings. <strong>The blue dots<\/strong> are what you\u2019ll want to click on as they indicate a capture of the web page. <strong>Green<\/strong> indicates a redirect,<strong> orange<\/strong> dots indicate the crawler received a client error and <strong>red<\/strong> means there was a server error. Navigating the timeline will display the dates of when the site was archived.<\/p>\n<figure class=\"wp-block-image\"><img decoding=\"async\" class=\"jetpack-lazy-image jetpack-lazy-image--handled\" src=\"https:\/\/lh5.googleusercontent.com\/HZ5EZ9KzqXh6DiyQ0bsKw9v4hJ86o64IdCFg-OJmFVIeyFyVIUD7IflopUid5GGvCvGlB2i1nkmY7IaZtF0Fg2TjG6tXzln5dbVDrGPpvkmnQWKUQHiCdFi5fwy3ycPyRbWyCWg5\" alt=\"\" data-lazy-loaded=\"1\" \/><figcaption>Example of the time line<\/figcaption><\/figure>\n<ul>\n<li>If you want to view all the archives of a particular domain, use the link\u00a0<a href=\"https:\/\/web.archive.org\/*\/www.yoursite.com\/*\" target=\"_blank\" rel=\"noreferrer noopener\">https:\/\/web.archive.org\/*\/www.example.com\/*<\/a>\u00a0and replace\u00a0<a href=\"http:\/\/www.example.com\/\" rel=\"nofollow\">http:\/\/www.example.com<\/a>\u00a0with the site of your interest. As noted below, you can see that 117 URLS were captured for\u00a0<a href=\"https:\/\/www.osinttechiques.com\/\" target=\"_blank\" rel=\"noreferrer noopener\">www.osinttechiques.com<\/a>. Example:\u00a0<a href=\"https:\/\/web.archive.org\/web\/*www.colegiozaragozasur.es\/*\" target=\"_blank\" rel=\"noreferrer noopener\">https:\/\/web.archive.org\/web\/*www.colegiozaragozasur.es\/*<\/a><\/li>\n<\/ul>\n<figure class=\"wp-block-image\"><img decoding=\"async\" class=\"jetpack-lazy-image jetpack-lazy-image--handled\" src=\"https:\/\/lh4.googleusercontent.com\/o-_O10dA3JYjmnP0i_0GUz8rLz4D-k4snxZWh2sBcao4AifV-ISFzkzAOurSa-PcgQY1jtONcpW58uLwrMgjH-p98pITpIioNEgk26bqH1CEDPtGm_s8XGDnyA25RQQ4VTEr8r5h\" alt=\"\" data-lazy-loaded=\"1\" \/><figcaption>Example of all the URLs archived from Osinttechniques.com<\/figcaption><\/figure>\n<p><strong>Other Search Methods:<\/strong><\/p>\n<ul>\n<li>When you have a URL of interest, you can search here\u00a0<a href=\"https:\/\/archive.org\/web\">https:\/\/archive.org\/web<\/a>.<\/li>\n<\/ul>\n<p>Example: search\u00a0<a href=\"https:\/\/www.myspace.com\/\" target=\"_blank\" rel=\"noreferrer noopener\">www.myspace.com<\/a>\u00a0to see how the site has changed over time.<\/p>\n<figure class=\"wp-block-image\"><img decoding=\"async\" class=\"jetpack-lazy-image jetpack-lazy-image--handled\" src=\"https:\/\/lh5.googleusercontent.com\/GrtFRleeh8BOaFlYK-z8GE5X07CwT9srX1UImVUuSPaBvWm1vxLpMSkw6aEPexDsmpMcIe86x7je2Kw6QpaBdSey1djh-AQnl6NnizEvd-6xwUfWcF5U2eLZUloFESGEI5ARaFhD\" alt=\"\" data-lazy-loaded=\"1\" \/><figcaption>Blue dots are the most interesting to take a look at<\/figcaption><\/figure>\n<ul>\n<li>Conduct keyword searches here\u00a0<a href=\"https:\/\/web.archive.org\/web\" target=\"_blank\" rel=\"noreferrer noopener\">https:\/\/web.archive.org<\/a><\/li>\n<\/ul>\n<p>Example: search for \u201cosama bin laden\u201d to see what results are revealed or search for social media users such as the Facebook profile of Mark Zuckerberg.\u00a0<a href=\"https:\/\/web.archive.org\/web\/*\/www.facebook.com\/zuck\" target=\"_blank\" rel=\"noreferrer noopener\">https:\/\/web.archive.org\/web\/*\/www.facebook.com\/zuck<\/a><\/p>\n<figure class=\"wp-block-image\"><img decoding=\"async\" class=\"jetpack-lazy-image jetpack-lazy-image--handled\" src=\"https:\/\/lh4.googleusercontent.com\/zPUHwH9EuAgsrJJyBA_NqFnX9lngvDFHNcYU9yXw6MSVtpcC1PqnzUlJ4Wd__4J5dPSBI8xDFoEzMYbPCcWa0JhdOeVQwfakNJWGdrNHCqlpoxhox1rFK5YIA31Pg5yq3gMwejLU\" alt=\"\" data-lazy-loaded=\"1\" \/><\/figure>\n<ul>\n<li>Use the advanced search feature here\u00a0<a href=\"https:\/\/archive.org\/\" target=\"_blank\" rel=\"noreferrer noopener\">https:\/\/archive.org<\/a>\u00a0or by directly visiting\u00a0<a href=\"https:\/\/archive.org\/advancedsearch.php\" target=\"_blank\" rel=\"noreferrer noopener\">https:\/\/archive.org\/advancedsearch.php<\/a>\u00a0to perform more targeted searches and sometimes find the email address associated with a user who uploaded a file.<br \/>\nSome files require you to login to gain access, this is where you create a fake research account to investigate further\u00a0<a href=\"https:\/\/archive.org\/account\/signup\" target=\"_blank\" rel=\"noreferrer noopener\">https:\/\/archive.org\/account\/signup<\/a><\/li>\n<\/ul>\n<ul>\n<li>Use the steps below to understand how to find the email address associated with uploaded files. For OSINT research if you identify an email address, it\u2019s another point you can leverage and search that email address in other places such as search engines or social media sites.<\/li>\n<\/ul>\n<p>Example:\u00a0<a href=\"https:\/\/archive.org\/details\/FlintstonesWinstonCigaretteCommericals\" target=\"_blank\" rel=\"noreferrer noopener\">https:\/\/archive.org\/details\/FlintstonesWinstonCigaretteCommericals<\/a><\/p>\n<ol>\n<li>Scroll below to find \u201cdownload options\u201d<\/li>\n<li>Click on \u201cshow all\u201d to display all files.<\/li>\n<li>Click on the file that ends with \u201cmeta.xml\u201d<\/li>\n<li>Ctrl+f for the word \u201cuploader\u201d and you will see the email address:\u00a0donkeykongland2@yahoo.com<\/li>\n<\/ol>\n<figure class=\"wp-block-image\"><img decoding=\"async\" class=\"jetpack-lazy-image jetpack-lazy-image--handled\" src=\"https:\/\/lh5.googleusercontent.com\/Xxx9eYWA5N7IHbCL7egAITUOuunaX0yJ7TpcFyUNLGcv9IunhsquqLWXNmyLmW9Ap_w1Dc53I6s4v4Z2yu0dgXku_hsrQAFTv5y2Xv0tdnRNV-rDXB_t3XzUdUrn0KSSyyPwWr80\" alt=\"\" data-lazy-loaded=\"1\" \/><figcaption>Click on the button \u2018Show All\u2019 displayed in the light grey box on the right<\/figcaption><\/figure>\n<figure class=\"wp-block-image\"><img decoding=\"async\" class=\"jetpack-lazy-image jetpack-lazy-image--handled\" src=\"https:\/\/lh3.googleusercontent.com\/DmummvQO-z5DCXHWGdJK8RzQam9p7ciW8mdoJGoGa1NdDt2MR1I-8JTBH5OHnBllBOJ2ZklR2K1_ObG1KYsSZQvuOTdLTz0s9TZghdywC8wTk8cZOEfIJl4L3KJ1xugvDuxfAKCX\" alt=\"\" data-lazy-loaded=\"1\" \/><figcaption>Click on the \u2026meta.xml-file in the results.<\/figcaption><\/figure>\n<p><strong>Use Collections and Changes (beta):<\/strong><\/p>\n<ul>\n<li>Collections are a way to learn why a URL has been archived into the Wayback Machine.<\/li>\n<\/ul>\n<p>Example:\u00a0<a href=\"https:\/\/web.archive.org\/web\/collections\/2020*\/osinttechniques.com\" target=\"_blank\" rel=\"noreferrer noopener\">https:\/\/web.archive.org\/web\/collections\/2020*\/osinttechniques.com<\/a><\/p>\n<ul>\n<li>Changes allows users to select 2 different versions of a URL &amp; compare them side by side.<\/li>\n<\/ul>\n<p>Example:\u00a0<a href=\"https:\/\/web.archive.org\/web\/changes\/osinttechniques.com\" target=\"_blank\" rel=\"noreferrer noopener\">https:\/\/web.archive.org\/web\/changes\/osinttechniques.com<\/a><\/p>\n<p>Learn more about Collections and Changes here:\u00a0<a href=\"https:\/\/blog.archive.org\/2019\/10\/18\/the-wayback-machine-fighting-digital-extinction-in-new-ways\" target=\"_blank\" rel=\"noreferrer noopener\">https:\/\/blog.archive.org\/2019\/10\/18\/the-wayback-machine-fighting-digital-extinction-in-new-ways<\/a><\/p>\n<p><strong>Saving Pages:<\/strong><\/p>\n<ul>\n<li>Use\u00a0<a href=\"https:\/\/archive.org\/web\/\" target=\"_blank\" rel=\"noreferrer noopener\">https:\/\/archive.org\/web\/<\/a>\u00a0to request that a page be archived, the save button is visible at the bottom right of the screen or by going directly to\u00a0<a href=\"https:\/\/web.archive.org\/save\" target=\"_blank\" rel=\"noreferrer noopener\">https:\/\/web.archive.org\/save<\/a>. This \u201cSave Page Now\u201d option only captures that particular page and not the entire website and only works for sites that allow crawlers. The screenshot below shows an article from OSINT Curious saving to the archive.<\/li>\n<\/ul>\n<figure class=\"wp-block-image\"><img decoding=\"async\" class=\"jetpack-lazy-image jetpack-lazy-image--handled\" src=\"https:\/\/lh5.googleusercontent.com\/BdmDvH8k61b7pj8pRrW2HET8beIFr2wXr5hBiqM71VT2WMNz8jfRBzt68sCCMjMlbIfpgGIlhfWKtN3Ddvb4hVTTRL5s8iGrZitHywMcGmguaogeIvjyjpNR-zhmzYic7tAF9R6U\" alt=\"\" data-lazy-loaded=\"1\" \/><\/figure>\n<p>For sourcing purposes it may be important to understand when something was saved by the Internet Archive. Let\u2019s look at the link below:<\/p>\n<p><a href=\"https:\/\/web.archive.org\/web\/20180214034336\/http:\/\/www.osinttechniques.com\" target=\"_blank\" rel=\"noreferrer noopener\">https:\/\/web.archive.org\/web\/20180214034336\/http:\/\/www.osinttechniques.com<\/a><\/p>\n<p>The format of the numbers in the middle are yyyymmddhhmmss so the date the site was crawled was February 14, 2018 at 03:43 and 36 seconds.<\/p>\n<p>What if the site you are investigating isn\u2019t on the Internet Archive? Some sites will not be on the Archive.org due to robots.txt files or because a website owner has requested their site not be archived.<\/p>\n<p>However, you have other search options such as searching for cache content as mentioned in this blog post\u00a0<a href=\"https:\/\/osintcurio.us\/2019\/02\/12\/osint-on-deleted-content\" target=\"_blank\" rel=\"noreferrer noopener\">https:\/\/osintcurio.us\/2019\/02\/12\/osint-on-deleted-content<\/a>\u00a0or check other online archives such as\u00a0<a href=\"https:\/\/archive.today\/\" target=\"_blank\" rel=\"noreferrer noopener\" data-type=\"URL\" data-id=\"https:\/\/archive.today\">archive.today<\/a>.<\/p>\n<p>&nbsp;<\/p>\n<p>You can use all this resources, the way u want, but of course those are an unvaluable recource for those of us who work as SEO.<\/p>\n<p>Have a nice weekend, my friends.<\/p>\n\n<div class=\"twitter-share\"><a href=\"https:\/\/twitter.com\/intent\/tweet?via=cpizaragozasur\" class=\"twitter-share-button\" data-size=\"large\">Twittear<\/a><\/div>\n","protected":false},"excerpt":{"rendered":"<p>The Internet Archive, commonly known as the Wayback Machine allows users to visit archived versions of websites. The Internet Archive has been archiving sites since 1996 and has 514 billion&#8230;<\/p>\n","protected":false},"author":355,"featured_media":0,"parent":0,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"_s2mail":"","footnotes":""},"class_list":["post-491","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"https:\/\/wp.catedu.es\/zgzsur\/wp-json\/wp\/v2\/pages\/491","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/wp.catedu.es\/zgzsur\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/wp.catedu.es\/zgzsur\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/wp.catedu.es\/zgzsur\/wp-json\/wp\/v2\/users\/355"}],"replies":[{"embeddable":true,"href":"https:\/\/wp.catedu.es\/zgzsur\/wp-json\/wp\/v2\/comments?post=491"}],"version-history":[{"count":2,"href":"https:\/\/wp.catedu.es\/zgzsur\/wp-json\/wp\/v2\/pages\/491\/revisions"}],"predecessor-version":[{"id":494,"href":"https:\/\/wp.catedu.es\/zgzsur\/wp-json\/wp\/v2\/pages\/491\/revisions\/494"}],"wp:attachment":[{"href":"https:\/\/wp.catedu.es\/zgzsur\/wp-json\/wp\/v2\/media?parent=491"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}