Movatterモバイル変換


[0]ホーム

URL:


Wayback Machine
31 captures
11 Sep 2007 - 21 Jan 2022
MayJUNJul
06
201020112012
success
fail
COLLECTED BY
Organization:Internet Archive
The Internet Archive discovers and captures web pages through many different web crawls.At any given time several distinct crawls are running, some for months, and some every day or longer.View the web archive through theWayback Machine.

Web wide crawl with initial seedlist and crawler configuration from March 2011. This uses the new HQ software for distributed crawling by Kenji Nagahashi.

What’s in the data set:

Crawl start date: 09 March, 2011
Crawl end date: 23 December, 2011
Number of captures: 2,713,676,341
Number of unique URLs: 2,273,840,159
Number of hosts: 29,032,069

The seed list for this crawl was a list of Alexa’s top 1 million web sites, retrieved close to the crawl start date. We used Heritrix (3.1.1-SNAPSHOT) crawler software and respected robots.txt directives. The scope of the crawl was not limited except for a few manually excluded sites.

However this was a somewhat experimental crawl for us, as we were using newly minted software to feed URLs to the crawlers, and we know there were some operational issues with it. For example, in many cases we may not have crawled all of the embedded and linked objects in a page since the URLs for these resources were added into queues that quickly grew bigger than the intended size of the crawl (and therefore we never got to them). We also included repeated crawls of some Argentinian government sites, so looking at results by country will be somewhat skewed.

We have made many changes to how we do these wide crawls since this particular example, but we wanted to make the data available “warts and all” for people to experiment with. We have also done somefurther analysis of the content.

If you would like access to this set of crawl data, please contact us at info at archive dot org and let us know who you are and what you’re hoping to do with it. We may not be able to say “yes” to all requests, since we’re just figuring out whether this is a good idea, but everyone will be considered.

TIMESTAMPS
loading
The Wayback Machine - https://web.archive.org/web/20110606063526/http://db.tidbits.com/section/tech
 
Logged In!
To log in, enter your info below.
Email
Password

Visit

Submitted by


Thanks for submitting your tip! All submissions are moderated by an editor before appearing online. We've reset the form so you can enter another tip. Or you can close the tip submission box.
Close
TipBITS Submission
Please submit only technical tips that will help other TidBITS readers better use their Macs, iPhones, iPads, and related software and hardware. All product announcements should be sent toreleases@tidbits.com.
Tip title*
Your tip*
URL

Enter the URL to a Web page that supports your tip.
Linked text

Enter the name of the page linked above.
Your name*
Your email*
* indicates required fields
To help us avoid automated posts and spam, please enter the words below.
 
When you submit a tip, you give us permission to use it. Readour terms for more details. All submissions are reviewed before publication.
Our terms: By submitting a tip, you agree to assign TidBITS Publishing Inc., a non-exclusive, worldwide, perpetual license to reproduce, publish, and distribute your tip in connection with the TidBITS Web site and associated products in any media. You agree that you created the content you submitted, and that you have the right to assign us this license. You give us permission to use your name, but your email address won't be publicly displayed or shared. We review all submissions before publication, and reserve the right to select which submissions we feel are appropriate for our readers and to edit those we publish.
Close
Close

Thanks for submitting a comment! Please check your email for a link that, when clicked, will verify that you're a real person and cause your comment to appear immediately.
Close
Post a Comment
In reply to
Your comment

Remaining characters:0
Your name
Your email
 
Our terms: We reserve the right to edit or delete any comment, so please post thoughtfully. We use your email addressonly to send you a one-time verification message confirming that you posted this comment. We also store your address to allow you to verify using other Web browsers in the future. For more info, see ourprivacy policy.
Close
TidBITS: Apple News for the Rest of Us
  •    
  •  
 
Thoughtful, detailed coverage of the Mac, iPhone, and iPad, plus the best-sellingTake Control ebooks.

 

 
Set Password Activation Time in Snow Leopard

In Snow Leopard, you can now set an amount of time after your Mac goes to sleep or engages the screen saver before it requires a password to log back on. In Leopard, the option was simply to require the password or not. Choose among several increments, between 5 seconds and 4 hours, from System Preferences > Security.

Submitted by
Doug McLean

 
 
 

RSSTechnology News

Stay up to date with the latest tech news from Apple, Microsoft, Google, and other key players.

Apple to Unveil Lion, iOS 5, and iCloud in WWDC Keynote

In a brief press release, Apple has announced that Steve Jobs and a team of Apple executives would deliver the keynote at next week’s Worldwide Developer Conference, unveiling Mac OS X Lion, iOS 5, and the company’s much-rumored cloud-services offering: iCloud.

byAdam C. Engst | 31 May 2011 |3 comments
 
Show summaries of articles

Guy Kawasaki to Keynote MacTech Conference 2011

MacTech Magazine has announced that Guy Kawasaki will be keynoting the MacTech Conference 2011 in November.
byAdam C. Engst | 03 Jun 2011 |0 comments

Once again, Adam joined Andy Ihnatko, Chris Breen, John Gruber, and Leo Laporte for MacBreak Weekly, where the tremendously enjoyable conversation ranged from geek hair care (Gruber relies on a Perl script!) to Apple Legal’s response to Lodsys to what we might see in a forthcoming iCloud music service from Apple.

Amazon Takes Aim at the Mac App Store

In an unexpected move, Amazon has taken the wraps off the Mac Downloads Store. It doesn’t have the selection or integration of the Mac App Store, but Apple suddenly has some competition that didn’t exist yesterday.
byAdam C. Engst | 26 May 2011 |9 comments

Apple Responds to Increasingly Serious MACDefender Situation

In an unusual about-face, Apple has gone from telling AppleCare reps not to help callers remove MACDefender infections to releasing a support document explaining how to do it and promising an update to Mac OS X to stop MACDefender and its variants. Hopefully it will also stop MacGuard, the new variant Intego has found that doesn’t even require an admin password to install.
byAdam C. Engst | 25 May 2011 |16 comments

Apple Tells Lodsys That iOS Developers Are Covered

iOS developers are breathing easier after Apple Legal has weighed in on the Lodsys patent infringement claims, asserting that developers are indeed covered by Apple’s license of the Lodsys patents.

The company behind the popular 1Password utility has changed its name from Agile Web Solutions (which didn’t really reflect what they did) to the pithier and more accurate AgileBits. It’s not big news, but in a day and age where there are all sorts of phishing scams and the like, knowing the true identity of our tool makers is important.

The U.S. Library of Congress has opened the National Jukebox, a Web site that offers Flash-based streaming access to over 10,000 78 rpm discs issued by the Victor Talking Machine Company between 1900 and 1925. (The discs have been made available thanks to blanket permission from the rights-holder, Sony Music.) Thousands more are slated to be added soon. This is one of the great promises of the Internet, though it’s worth noting that these recordings can’t be downloaded because sound recordings published before 1972 are subject to state and/or common law protection, not Federal copyright law, and thus won’t fully enter the public domain until 2067.

Small iOS Developers Targeted over In-App Purchase Patents

Several small iOS developers have received patent infringement claims for using Apple’s in-app purchasing in their apps.

Microsoft to Buy Skype for $8.5 Billion

In the largest-ever deal for Microsoft, the company has agreed to buy Internet telephony company Skype for $8.5 billion. Microsoft promises to continue to invest in Skype clients for non-Microsoft platforms.

Beware Fake MACDefender Antivirus Software

Intego has identified a new piece of malware called MACDefender that masquerades as an antivirus program in an attempt to sucker users into paying for the program, likely putting credit card numbers at risk for being stolen as well.

 

Dragon speech recognition software for Macintosh, iPhone, and iPad!
Get the all-new Dragon Dictate for Mac from Nuance Communications
and experience Simply Smarter Speech Recognition.
Learn more about Dragon Dictate: <http://nuance.com/dragon/mac>
 

Macs Targeted by New “Crimekit”

The appearance of a new “crimekit” aimed at Mac OS X users is cause for concern, but not yet for a change in behavior, assuming, of course, that you were behaving cautiously up to this point.

The news site Ars Technica (owned by Condé Nast Digital) woke up Thursday morning to find their Facebook page locked after an unknown person complained to Facebook that some piece of Ars Technica content infringed on their rights. With no warning, explanation, or clear appeal process, and with only minimal communication after Ars staffers started to investigate, the Ars Technica Facebook page remained inaccessible the entire day. (It has now been restored, with a statement from Facebook apologizing weakly and justifying the action.) If this can happen to a major news outlet like Ars Technica, which can bring corporate resources to bear on resolving the situation, just imagine how much fun it would be for a small business. Moral of the story: Do not rely on Facebook for anything critical to your business. Facebook is not your friend.

Apple Addresses Location Controversy Questions

Attempting to put the controversy over iPhone location tracking to rest, Apple has released a Q&A; that answers the primary questions about what the iPhone is doing and why. Apple even acknowledged that there are some bugs in iOS that will be fixed in a forthcoming update.

Apple Breaks More Records for Q2 2011

Apple reporting billions of dollars in revenue and profit for its last financial quarter? No surprise there. But where that money came from may raise some eyebrows.

Apple Previews Final Cut Pro X: New, Faster, and Cheaper

Apple previewed Final Cut Pro X at the National Association of Broadcasters Show in Las Vegas, showing off new features and promising a $299 price through the Mac App Store for when it ships in June.
byLex Friedman | 13 Apr 2011 |5 comments

MacTech Boot Camps Fast Approaching

Consultants and support techs take note — the MacTech Boot Camp conferences are upon us, with the Dallas and Boston events within the next month. Read on for special discounts for TidBITS readers!

Backup Deals for World Backup Day

Thursday, March 31st has been dubbed World Backup Day, and a bunch of online backup providers have special deals in the works.
byAdam C. Engst | 30 Mar 2011 |10 comments

While WWDC is aimed primarily at developers and provides the Apple party line, those interested in an independent approach should look at the 2011 MacTech Conference, scheduled for November 2nd through 4th in Los Angeles. The hotel-based MacTech Conference offers two tracks, one focused on IT topics, the other on Mac and iOS development. Conference registration costs $999, but a limited number of early-bird slots are available for $799.
28 Mar 2011

Apple has announced that the company’s 2011 Worldwide Developer Conference will be held June 6th through 10th at Moscone West in San Francisco. Conference passes cost $1,599, but no more are available since the conference sold out in the first day.
28 Mar 2011

Judge Rejects Google Books Settlement

A judge has rejected a sweeping settlement of lawsuits against Google that would have given the search company a kind of monopoly, and created a cartel for pricing by authors and publishers. Good.

Next 20 articles for Technology News...

 

THE MISSING SYNC FOR ANDROID: Sync with iTunes, Address Book,
iCal, iPhoto and other Mac applications. Supports HTC EVO,
Motorola Droid, Droid X, Droid 2, Google Nexus One and many
other phones. <http://www.markspace.com/bits>
 
 

TidBITS is copyright © 2011 TidBITS Publishing Inc.

Reuse governed byCreative Commons License.

About TidBITS |Advertise with TidBITS! |Contact Info |Copyright Terms

 

[8]ページ先頭

©2009-2026 Movatter.jp