Friday, July 23, 2010

What The Search Engines Say - Google - Part II

When your site is ready:

• Once your site is online, submit it to Google at http://www.google.com/addurl.html.
• Make sure all the sites that should know about your pages are aware your site is online.
• Submit your site to relevant directories such as the Open Directory Project and Yahoo!.
• Periodically review Google's webmaster section for more information.

Quality Guidelines - Basic principles:

• Make pages for users, not for search engines. Don't deceive your users, or present different content to search engines than you display to users.
• Avoid tricks intended to improve search engine rankings. A good rule of thumb is whether you'd feel comfortable explaining what you've done to a website that competes with you. Another useful test is to ask, "Does this help my users? Would I do this if search engines didn't exist?"
• Don't participate in link schemes designed to increase your site's ranking or PageRank. In particular, avoid links to web spammers or "bad neighbourhoods" on the web as your own ranking may be affected adversely by those links.
• Don't use unauthorized computer programs to submit pages, check rankings, etc. Such programs consume computing resources and violate our terms of service. Google does not recommend the use of products such as Webposition Gold™ that send automatic or programmatic queries to Google.

Quality Guidelines - Specific recommendations:

• Avoid hidden text or hidden links.
• Don't employ cloaking or sneaky redirects.
• Don't send automated queries to Google.
• Don't load pages with irrelevant words.
• Don't create multiple pages, sub domains, or domains with substantially duplicate content.
• Avoid "doorway" pages created just for search engines, or other "cookie cutter" approaches such as affiliate programs with little or no original content.

These quality guidelines cover the most common forms of deceptive or manipulative behaviour, but Google may respond negatively to other misleading practices not listed here, (e.g. tricking users by registering misspellings of well-known web sites). It's not safe to assume that just because a specific deceptive technique isn't included on this page, Google approves of it. Webmasters who spend their energies upholding the spirit of the basic principles listed above will provide a much better user experience and subsequently enjoy better ranking than those who spend their time looking for loopholes they can exploit.
If you believe that another site is abusing Google's quality guidelines, please report that site at http://www.google.com/contact/spamreport.html. Google prefers developing scalable and automated solutions to problems, so we attempt to minimize hand-to-hand spam fighting. The spam reports we receive are used to create scalable algorithms that recognize and block future spam attempts.

Sunday, July 18, 2010

What The Search Engines Say - Google - Part I

Following these guidelines will help Goggle find, index, and rank your site, which is the best way to ensure you'll be included in Google's results. Even if you choose not to implement any of these suggestions, we strongly encourage you to pay very close attention to the "Quality Guidelines," which outline some of the illicit practices that may lead to a site being removed entirely from the Google index. Once a site has been removed, it will no longer show up in results on Google.com or on any of Google's partner sites.

Design and Content Guidelines:

• Make a site with a clear hierarchy and text links. Every page should be reachable from at least one static text link.
• Offer a site map to your users with links that point to the important parts of your site. If the site map is larger than 100 or so links, you may want to break the site map into separate pages.
• Create a useful, information-rich site and write pages that clearly and accurately describe your content.
• Think about the words users would type to find your pages, and make sure that your site actually includes those words within it.
• Try to use text instead of images to display important names, content, or links. The Google crawler doesn't recognize text contained in images.
• Make sure that your TITLE and ALT tags are descriptive and accurate.
• Check for broken links and correct HTML.
• If you decide to use dynamic pages (i.e., the URL contains a '?' character), be aware that not every search engine spider crawls dynamic pages as well as static pages. It helps to keep the parameters short and the number of them small.
• Keep the links on a given page to a reasonable number (fewer than 100).

Technical Guidelines:

• Use a text browser such as Lynx to examine your site, because most search engine spiders see your site much as Lynx would. If fancy features such as JavaScript, cookies, session ID's, frames, DHTML, or Flash keep you from seeing all of your site in a text browser, then search engine spiders may have trouble crawling your site.
• Allow search bots to crawl your sites without session ID's or arguments that track their path through the site. These techniques are useful for tracking individual user behaviour, but the access pattern of bots is entirely different. Using these techniques may result in incomplete indexing of your site, as bots may not be able to eliminate URLs that look different but actually point to the same page.
• Make sure your web server supports the If-Modified-Since HTTP header. This feature allows your web server to tell Google whether your content has changed since we last crawled your site. Supporting this feature saves you bandwidth and overhead.
• Make use of the robots.txt file on your web server. This file tells crawlers which directories can or cannot be crawled. Make sure it's current for your site so that you don't accidentally block the Googlebot crawler. Visit http://www.robotstxt.org/wc/faq.html for a FAQ answering questions regarding robots and how to control them when they visit your site.
• If your company buys a content management system, make sure that the system can export your content so that search engine spiders can crawl your site.

Wednesday, July 7, 2010

Getting Indexed

Getting Indexed

Once your site is listed in a search engine then you are “indexed”. To be indexed you can ask other sites that are already indexed to have a link pointing to you, get listed in a directory (detailed later) or manually submit your site to the search engines (only needs doing once)

There seems to be little or no logic to the time frame of being indexed so treat the search engines as temperamental beings and you will keep your sanity. Some sites will be listed in a matter of days others will take months.

You may notice that I continually talk about Goggle mostly even though there are other major search engines out there. This is because figures show that Goggle handle 47.3% of searches and Yahoo is next with 20.9 % with MSN at 13.6%.

What the search engines say

I will post in the next days extracts for webmasters or website owners for site ranking and positioning. These were all taken direct from the search engine sites but not all from one page. Where possible we have tried to include all relevant information in a fluid style but as you can imagine with search engines wanting to keep algorithms secret, they don’t give too much away. If you are brand new to website and Internet marketing then there are a few pearls of wisdom in there. If not then the rest of this blog offers more valuable information.

Thursday, July 1, 2010

Algorithms

Algorithms are not just terms you hear being used frequently in Star Trek, they are also the foundations for good search engines. In short and in search engine terms, an algorithm is a set of rules that governs each websites position in relationship to others containing the same keywords and phrase. For example, with two websites selling the same unique product, they both cannot be listed at position one so the search engine has to decide which is the more relevant website for the given search phrase and show it above the other in the search results. Just because one website shows above another for one phrase, it does not mean it always will for others. There are hundreds of factors which affect ranking in search engines many of which we can only guess at.

These algorithms are very closely guarded secrets because should they become known, then website owners could easily change or create websites to become the first site listed for certain keywords or phrases. This happened early on in the Internet boom and search engines would return results that were completely irrelevant to the searcher. The more relevant results a search engine returns then more people will use it and so the more work search engines put in to ensure highly focused results are returned to searchers.