PDA

View Full Version : Google Sitemaps


clasione
06-04-2005, 02:02 AM
WEBPRONEWS ARTICLE,

Recently, Google launched Sitemaps (https://www.google.com/webmasters/sitemaps/login), a "collaborative crawling" service designed to keep Google informed of modifications to your web site so their search index can reflect these changes… or as Rusty call it (http://www.seroundtable.com/archives/002034.html), a free pay-for-inclusion program.

Sitemaps works by taking advantage of XML and RSS capabilities. By placing XML code on the web server, you inform Google of when changes occur, and they respond by crawling the updated pages and making the necessary updates to the search index. Over at the Google Blog (http://googleblog.blogspot.com/2005/06/webmaster-friendly.html), Engineering Director Shiva Shivakumar indicated why Google launched Sitemaps:

"Initially, we plan to use the URL information webmasters supply to further improve the coverage and freshness of our index. Over time that will lead to our doing an even better job of delivering more search results from more websites."

Shiva also gave an extensive interview to Danny Sullivan over at the SearchEngineWatch Blog (http://blog.searchenginewatch.com/blog/050602-195224). In it, Shiva iterates that the Sitemaps program's current beta state; he won't guarantee each submitted URL would be crawled. He did indicate that this was something they were working toward, however.

As mentioned, in order to participate in Sitemaps, you have to have a Google account and you have to place an XML file on the webserver being used by your site. This is done in order to inform Google's crawlers of what URLs to look for and how often these pages change. As pointed out by Rusty, over at SocialPatterns.com (http://www.socialpatterns.com/search-engine-optimization/breaking-down-google-sitemaps-xml/), SEM Michael Nguyen broke down an example of his Sitemaps' XML code, line-by-line; in order to shed some light on what's actually being done.

The XML file must also the URL of each page you want to be in the Sitemaps program. If you have four pages that undergo frequent change, all 4 page URLs should be listed, if you have an entire site that you want included, you have to include the URL of each page. By employing the changefreq and priority XML tags, you can also indicate how important each page is and how frequently the page changes.

After the XML is complete, you must submit it to the Sitemaps program. This is where the Google account comes in. Once the URL of the sitemap is submitted, your task is complete.

There are a couple of methods you can use in order to get an XML sitemap. A sitemap generator (https://www.google.com/webmasters/sitemaps/docs/en/sitemap-generator.html) can be downloaded from Google or you can develop one. The generator is an open-source Python file that has to be uploaded to the webserver. According to their FAQ the sitemap generator "can create sitemaps from URL lists, webserver directories, or from access logs."

You can also develop your own XML sitemap (https://www.google.com/webmasters/sitemaps/docs/en/protocol.html) if you so choose. This will have to be submitted as well. The final method Google accepts is a text file containing the URLs you want in the program. Obviously, this method is saved for those who have little experience dealing with webservers or structural web alterations. It also seems like the text files will be given the lowest priority, at least until the program is off and running.

As to whether or not you should be taking part in the Google Sitemaps program is quite simple: if search engines play any role in your business whatsoever, you should be apart of the program. Having Google's (or any other search engine for that matter) index reflect changes in your site quickly will only benefit your search engine presence. Or as Nathan Weinberg (http://google.blognewschannel.com/index.php/archives/2005/06/02/google-begins-website-update-reporting-service/) says, it'd be stupid not too.

An additional area of interest is that Google made Sitemaps as open-source as possible… at least on the XML end. By making the sitemap generator in Python and releasing it under the Attribution/Share Alike Creative Commons license (http://creativecommons.org/licenses/by-sa/2.0/), Google is only furthering their embrace of open-source. This also allows the program to be adapted in order to support other search engines.

Update: Someone who emailed (http://www.incendiary.ws/node/94) me installed the sitemap generator on his webserver, and evidently the server went boom... or it at least was overtaxed. Here's a quote from his post discussing the event: "Running it brought down my 3200MHz Pentium 4 running Debian Linux and 2 GB of RAM."

Read Theo's report (http://www.incendiary.ws/node/94) and see for yourselves.

About the Author:
Chris Richardson is a search engine writer for WebProNews (http://www.webpronews.com/). Visit WebProNews for the latest search news (http://www.webpronews.com/).

_______________________________________

I'm not overly exited about this site progeram just yet. Although it is great to be able to attract Google or any other search engine on a frequent basis, if your site is strong enough and updated enough the search engines will be back to crawl it whether or not you are using the Google sitemap program.

I'm more concerned at what benefits this program serves to Google and exactly why they are launching it when they have so much other things going on...

The only thing I could see happening by participating in this program, is that Google will now know for sure exactly which website is concerned with search engine rankings and which simply don't care....

hmm, that could be worth more to them than knowing they have YOUR most recent data in there index....

It's not something I will do anytime soon. My job is getting that crawler to my site daily regardless becasue my site and content warrent it.

It could be something I move to in the future, but for now, I don't immediatly see the benefit.

Anthony Parsons
06-05-2005, 10:54 AM
Yep, agreed. I don't see any real benefit to the program unless your site is very very large and googlebot is chewing bandwidth to the extent it is costing you considerably. In most cases, it shouldn't affect you. I think probably only 1% of the webs actual sites would really call for such a service. For those large sites, news and media, etc etc, that really warrant some control over the SE bots, absolutely. For something like Slashdot, using the sitemap program could save them thousands of $$$ a year in bandwidth that googlebot chews up spidering the same content over and over, when in fact, it really only needs to spider new and updated content.

I know most people are going to use it, and spruke some sort of ranking advantage, blah blah blah, but it won't actually impact them at all IMHO.

clasione
06-05-2005, 11:09 AM
yea, I'm quite surprized they said this:

"Or as Nathan Weinberg (http://google.blognewschannel.com/index.php/archives/2005/06/02/google-begins-website-update-reporting-service/) says, it'd be stupid not too."

I completely disagree with that statement.....

kservik
06-08-2005, 04:14 AM
yea, I'm quite surprized they said this:

"Or as Nathan Weinberg (http://google.blognewschannel.com/index.php/archives/2005/06/02/google-begins-website-update-reporting-service/) says, it'd be stupid not too."

I completely disagree with that statement.....

It might still give the advantage of faster indexing and setting google to watch special pages more closely.

clasione
06-08-2005, 04:17 AM
There definetly could be some benefits, but nothing that sparks my interest just yet....

Searchen Networks Forum Archive - Forums | Directory | What's New | Popular | Add Listing | Edit Listing | Company | Domains | Hosting | Marketing | Terms | Search

Arts
Animation, Antiques, Architecture...
Business
Accounting_, Aerospace_and_Defense, Agriculture_and_Forestry...
Computers
Algorithms, Artificial_Intelligence, Artificial_Life...
Games
Board_Games, Card_Games, Computer_Games...
Health
Addictions, Aging, Alternative...
Home
Apartment_Living, Consumer_Information, Cooking...
Kids_and_Teens
Arts, Computers, Directories...
News
Breaking_News, Chats_and_Forums, Current_Events...
Recreation
Antiques, Audio, Autos...
Reference
Almanacs, Archives, Ask_an_Expert...
Science
Agriculture, Anomalies_and_Alternative_Science, Astronomy...
Shopping
Antiques_and_Collectibles, Auctions, Autos...
Society
Activism, Advice, Crime...
Sports
Baseball, Basketball, Billiards...
United_States
Alabama, Alaska, Arizona...
World
Abkhazia, Afghanistan, Albania...

 

Animation | Antiques | Architecture | Archives | Art_History | Awards | Bodyart | Chats_and_Forums | Classical_Studies | Comics | Costumes | Crafts | Cultures_and_Groups | Dance | Design | Digital | Directories | Education | Entertainment | Events | Genres | Graphic_Design | Humanities | Illustration | Libraries | Literature | Magazines | Movies | Museums | Music | Myths_and_Folktales | Native_and_Tribal | News_and_Media | Online_Writing | Organizations | People | Performing_Arts | Periods_and_Movements | Photography | Radio | Regional | Rhetoric | Television | Theatre | Typography | Video | Visual_Arts | Writers_Resources

Accounting_ | Aerospace_and_Defense | Agriculture_and_Forestry | Arts_and_Entertainment_ | Associations_ | Automotive_ | Biotechnology_and_Pharmaceuticals_ | Business_and_Society_ | Business_Law | Business_Services | Business_Travel | Chemicals_ | Classifieds_ | Construction_and_Maintenance_ | Consumer_Goods_and_Services_ | Cooperatives_ | Customer_Service_ | Dictionaries_ | Directories_ | E-Commerce | Education_and_Training | Electronics_and_Electrical_ | Employment_ | Energy_and_Environment | Financial_Services | Food_and_Related_Products | Healthcare | History | Hospitality | Human_Resources | Industrial_Goods_and_Services | Information_Technology | Insurance | International_Business_and_Trade | Investing | Major_Companies | Management | Marketing_and_Advertising | Mining_and_Drilling | News_and_Media | Opportunities | Publishing_and_Printing | Real_Estate | Resources | Retail_Trade | Small_Business | Software | Telecommunications | Textiles_and_Nonwovens | Transportation_and_Logistics | Wholesale_Trade

Algorithms | Artificial_Intelligence | Artificial_Life | Bulletin_Board_Systems | CAD_and_CAM | Chats_and_Forums | Companies | Computer_Science | Consultants | Data_Communications | Data_Formats | Desktop_Publishing | Directories | E-Books | Education | Employment | Emulators | Ethics | Fonts | Games | Graphics | Hacking | Hardware | History | Home_Automation | Human-Computer_Interaction | Internet | Intranet | Mailing_Lists | MIS | Mobile_Computing | Multimedia | Newsgroups | News_and_Media | Open_Source | Operating_Systems | Organizations | Parallel_Computing | Performance_and_Capacity | Product_Support | Programming | Repair | Robotics | Security | Shopping | Software | Speech_Technology | Supercomputing | Systems | Usenet | Virtual_Reality

Arcade Games | Board_Games | Card_Games | Computer_Games | Console_Games | Developers_and_Publishers | Dice | Gambling | Game_Studies | Online | Party_Games | Puzzles | Resources | Roleplaying | Shopping | Trading_Card_Games | Video_Games

Addictions | Aging | Alternative | Animal | Beauty | Child_Health | Conditions_and_Diseases | Dentistry | Directories | Disabilities | Education | Employment | Environmental_Health | Fitness | Hair_Care | Healthcare_Industry | History | Home_Health | Insurance | Medicine | Men's_Health | Mental_Health | News_and_Media | Nursing | Nutrition | Occupational_Health_and_Safety | Organizations | Pharmacy | Products_and_Shopping | Professions | Publications | Public_Health_and_Safety | Reproductive_Health | Resources | Senior_Health | Senses | Services | Specific_Substances | Support_Groups | Teen_Health | Weight_Loss | Women's_Health

Apartment_Living | Consumer_Information | Cooking | Do-It-Yourself | Domestic_Services | Emergency_Preparation | Entertaining | Family | Gardening | Homemaking | Homeowners_ | Home_Automation | Home_Business | Home_Buyers | Home_Improvement_ | Moving_and_Relocating | News_and_Media_ | Personal_Finance_ | Personal_Organization_ | Pets_ | Rural_Living_ | Seniors_ | Shopping_ | Software_ | Urban_Living_

Arts | Computers | Directories | Entertainment | Games | Health | International | News | People_and_Society | Pre-School | School_Time | Sports_and_Hobbies | Teen_Life | Your_Family | Breaking_News | Chats_and_Forums | Current_Events | Directories | Extended_Coverage | Internet_Broadcasts | Journalism | Journals | Magazines_and_E-zines | Media | Newspapers | Online_Archives | Politics | Radio | Services | Sports | Television | Weather | Weblogs

Antiques | Audio | Autos | Aviation | Birding | Boating | Bowling | Camps | Climbing | Collecting | Crafts | Directories | Fireworks | Fishing | Gambling | Games | Gardens | Genealogy | Guns | Horoscopes | Humor | Knives | Martial_Arts | Models | Motorcycles | Outdoors | Scouting | Sports | Theme_Parks | Travel

Almanacs | Archives | Ask_an_Expert | Bibliography | Biography | Books | Dictionaries | Directories | Education | Encyclopedias | Flags | Geography | Journals | Knots | Knowledge_Management | Libraries | Maps | Museums | Open_Access_Resources | Parliamentary_Procedure | Questions_and_Answers | Quotations | Scientific_Reference | Style_Guides | Thesaurus | Time | World_Records

Agriculture | Anomalies_and_Alternative_Science | Astronomy | Biology | Chats_and_Forums | Chemistry | Conferences | Directories | Earth_Sciences | Educational_Resources | Employment | Environment | History_of_Science | Institutions | Instruments_and_Supplies | Math | Methods_and_Techniques | Museums | News_and_Media | Philosophy_of_Science | Physics | Publications | Reference | Science_in_Society | Search_Engines | Social_Sciences | Software | Technology | Women

Antiques_and_Collectibles | Auctions | Autos | Beauty_Products | Books | Children | Classifieds | Clothing | Computers | Consumer_Electronics | Crafts | Credit_Services | Death_Care | Directories | Education | Entertainment | Flowers | Food | Furniture | General_Merchandise | Gifts | Health | Holidays | Home_and_Garden | Jewelry | Music | Niche | Office_Products | Pets | Photography | Publications | Recreation | Religious | Sports | Tobacco | Tools | Toys_and_Games | Travel | Vehicles | Weddings | Wholesale

Activism | Advice | Crime | Death | Disabled | Economics | Education | Ethnicity | Folklore | Future | Gay,_Lesbian,_and_Bisexual | Genealogy | Government | History | Holidays | Issues | Language_and_Linguistics | Law | Lifestyle_Choices | Men | Military | Organizations | Paranormal | People | Philanthropy | Philosophy | Politics | Relationships | Religion_and_Spirituality | Sexuality | Social_Sciences | Sociology | Subcultures | Support_Groups | Transgendered | Urban_Legends | Women | Work

Baseball | Basketball | Billiards | Bowling | Boxing | Cheerleading | Cycling | Darts | Equestrian | Extreme_Sports | Fishing | Football | Golf | Greyhound_Racing | Gymnastics | Hockey | Horse_Racing | Hunting | Lacrosse | Martial_Arts | Motorsports | Racquetball | Shooting | Skateboarding | Soccer | Softball | Swimming | Tennis | Track_and_Field | Volleyball | Wrestling

Alabama | Alaska | Arizona | Arkansas | California | Colorado | Connecticut | Delaware | Florida | Georgia | Guides_and_Directories | Hawaii | Idaho | Illinois | Indiana | Iowa | Kansas | Kentucky | Louisiana | Maine | Maps_and_Views | Maryland | Massachusetts | Michigan | Minnesota | Mississippi | Missouri | Montana | Nebraska | Nevada | New_Hampshire | New_Jersey | New_Mexico | New_York | North_Carolina | North_Dakota | Ohio | Oklahoma | Oregon | Pennsylvania | Rhode_Island | South_Carolina | South_Dakota | Tennessee | Texas | Utah | Vermont | Virginia | Washington | Washington,_DC | West_Virginia | Wisconsin | Wyoming

Afghanistan | Albania | Algeria | Andorra | Angola | Antigua | Argentina | Armenia | Australia | Austria | Azerbaijan | Bahamas | Bahrain | Bangladesh | Barbados | Belarus | Belgium | Belize | Benin | Bhutan | Bolivia | Bosnia | Botswana | Brazil | Brunei | Bulgaria | Burkina | Burundi | Cambodia | Cameroon | Canada | Cape_Verde | Chad | Chile | China | Colombia | Comoros | Congo | Costa_Rica | Cote_d'Ivoire | Croatia | Cuba | Cyprus | Czech_Republic | Denmark | Djibouti | Dominica | Dominican_Republic | East_Timor | Ecuador | Egypt | El_Salvador | Equatorial_Guinea | Eritrea | Estonia | Ethiopia | Fiji | Finland | France | Gabon | Gambia | Georgia | Germany | Ghana | Greece | Grenada | Guatemala | Guinea | Guyana | Haiti | Honduras | Hungary | Iceland | India | Indonesia | Iran | Iraq | Ireland | Israel | Italy | Jamaica | Japan | Jordan | Kazakhstan | Kenya | Kiribati | Korea | Kuwait | Kyrgyzstan | Laos | Latvia | Lebanon | Lesotho | Liberia | Libya | Liechtenstein | Lithuania | Luxembourg | Macedonia | Madagascar | Malawi | Malaysia | Maldives | Mali | Malta | Marshall_Islands | Mauritania | Mauritius | Mexico | Micronesia | Moldova | Monaco | Mongolia | Morocco | Mozambique | Myanmar | Nagorno | Namibia | Nauru | Nepal | Netherlands | New_Zealand | Nicaragua | Niger | Nigeria | Northern_Cyprus | Norway | Oman | Pakistan | Palau | Palestine | Panama | Papua_New_Guinea | Paraguay | Peru | Philippines | Poland | Portugal | Qatar | Romania | Russia | Rwanda | Saint_Kitts_and_Nevis | Saint_Lucia | Saint_Vincent_and_the_Grenadines | Samoa | San_Marino | Sao_Tome | Saudi_Arabia | Senegal | Serbia | Seychelles | Sierra_Leone | Singapore | Slovakia | Slovenia | Solomon_Islands | Somalia | Somaliland | South_Africa | South_Ossetia | Spain | Sri_Lanka | Sudan | Suriname | Swaziland | Sweden | Switzerland | Syria | Taiwan | Tajikistan | Tanzania | Thailand | Togo | Tonga | Transnistria | Trinidad | Tunisia | Turkey | Turkmenistan | Tuvalu | Uganda | Ukraine | United_Arab_Emirates | United_Kingdom | Uruguay | Uzbekistan | Vanuatu | Vatican_City | Venezuela | Vietnam | Western_Sahara | Yemen | Zambia | Zimbabwe |