Article
Latest Search Engine Spam Techniques
Page: 1 2
Links Inside No Script Tags
One top publishing site I recently discovered secretly interlinked its sites using the no script tag. Although I can't name the site, I can show you how the technique worked.
Used legitimately, the no script tag provides spiderable links when a user's browser (or a search engine robot), has its JavaScript turned off. Anything that appears inside the no script tags is not visible on the Web page itself.
To be used authentically, the no script tag must contain links that replicate those used within JavaScript code in the actual page.
But in this case, the links went to sites that strategically collected PageRank. They were basically hidden, acting as underground network of links to support the publisher's rankings. This code appeared in almost all of the site's many domains -- and perhaps exists in the Websites of others, who may not even know it's there! Some of the pages only used a closing </NO SCRIPT) tag, which could also confuse search engines.
<SCRIPT LANGUAGE="javascript" SRC="http://www.spammersite1.com/counter.asp?ID=2667&NoLink=1" TYPE="text/javascript"></SCRIPT>
<NOSCRIPT><a href="http://www.spammersite3.com">new homes</a> <a href="http://www.spammersite3.com/popularkeywords.asp?
Keyword=concrete+design">concrete
design</a> <a href="http://www. spammersite3.com/popularkeywords.asp?
Keyword=precast">precast</a>
<a href="http://www. spammersite3.com/popularkeywords.asp?
Keyword=mantel">mantel</a>
<a href="http://www.spammersite4.net/">home decorating</a>
<a href="http://www.spammersite5.biz/">home
improvement world</a> <a href="http://www.spammersite6.com">luxury homes</a>
</NOSCRIPT>
The complex code above was even loaded with keywords (using asp code). These keywords signaled to the Web server at the spam target site the type of dynamically generated page that should be served in response to the query. These tactics are not approved of if they're done deliberately to manipulate search rankings. Visitors to this site were totally oblivious to the devious intent of the site owner, and search engines were fooled as well.
Non-Robot JavaScript Detectable Redirects
The use of mouseover code like that shown below is quietly spreading across the Web:
<body onMouseOver="eval(unescape('%6C%6F%63%61%74%69%6F%
6E%2E%686F%70%69%63%62%61%74%6F%6E%73%2E%6E%65%74%2F%27%3B'));"
There have been rumors that Google is taking action against this tactic. In the cases I discovered, the JavaScript code automatically redirected the visitor to another page, but only upon the cursor being moved over the page itself. It was almost impossible for the user to avoid setting this code off.
I found the code on a site ranked number one on Google for its primary keyword phrase. As search engine robots don't use a mouse, they're blind to the spamming activity. In this case, the tactic was combined with a server side redirect to another page, which was relevant only in some cases. The purpose of the redirect may have been part of bigger ploy to support another ranking strategy.
Dynamic Real Time Page Generation
It is possible for a Web server to produce and serve different, optimized pages according to the referrer of any page request.
In theory, there is nothing wrong with serving a page that's customized to the circumstance in which it was requested -- indeed, many ad campaigns serve up different ads based on the type of banner that was clicked. Customized ads are seen as being far more effective and useful for users.
With dynamic page spam, however, the site is loaded with hundreds of these phantom pages (dynamic urls) that act as affiliate links to some other site. Search engines don't want affiliate links. In the case I found, all the links were credited to the site's backlink count.
I don't think this is what search engines had in mind when they began to spider dynamic urls -- they certainly don't want to allow affiliate link spam.
Here's what the links typically look like:
www.spammersite7.com/perl/click.pl?id=2068&a=i
When the robot follows the links, it receives a meta refesh that links to an error page called redirect.cfm. This page has links back to the home page, which are credited to the site's backlink count.
<meta http-equiv="REFRESH" content="2; URL=http:// www.spammersite7.com/redirect.cfm?url=spammersite7.com">
</head>
<body onLoad="document.form1.submit();" >
Please Wait...
<form name="form1" method="post" action="redirect.cfm">
<input type="hidden" name="url" value=" spammersite7.com ">
</form>
DHTML Layering and Hidden Text
Using DHTML layering, spammers can hide layers of keywords beneath graphics. One layer covers the other visually, yet the text hidden on the lowers layer is readable by the search engine robot -- another highly illegal technique.
HTML Hidden Table Cells
The combined powers of CSS and html and the loose dtd <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd"> allow the unscrupulous site owner to hide the content of table cells loaded with keywords and heading tags.
CSS permits the flexible positioning of Web page elements; it's a flexible coding language that search engines do not fully understand at present. In short, the search engine doesn't really know what's being displayed. This trickery can be specified in a separate CSS sheet (.css file), which a search engine may or may not index. This CSS style sheet file, however, does affect the display of content on the page.
In this example, the CSS affects the display of the body of the Web page, which is set to 97%:
{font-family: Arial, Helvetica, sans-serif; width:97%; font-size: 10pt; overflow: hidden; color: #000000; margin: 0px;}.
Within the regular code, .gif files can be placed in the page at a width of 150%, ensuring that part of the page is not seen. That extra 50% provides plenty of room for keywords stuffed into <h1> tags.
Enormous Machine-Generated Websites
Those Webmasters who are not adept to html, dhtml or css tricks may try something simpler. When there's not enough content to go around, they often try to stretch a minimal amount of content across thousands of pages. The pages are built with templates and the sentences within them are basically shuffled from one page to the next. Unique title tags are plugged into each page that's generated.
This technique basically sees the same page repeated hundreds to thousands of times. It can even be done using a computer program that systematically stuffs the text sentences, paragraphs and headings, including keywords, into pages.
This technique is most often used with ecommerce sites that have a limited range of products for sale. Often, the products are simply re-organized, or shuffled to create another page that appears to be unique. It's actually the same selection of products presented countless different ways.
Link Spam
To maximize Pagerank distribution throughout a Website, some spammers will fill a page with links to the point where it is just a links page, and every page links to every other page.
Why do this? Well, by maximizing the number of links, the spammer more equally spreads PageRank throughout his or her site. When links from all those pages point to a single page on a keyword topic, the site can gain higher rankings for that phrase.
Link exchanges are also considered link spam. The links are fabricated -- not a real reflection of personal choice. Most link exchanges are now being filtered out of search results; however, some links in link exchanges are still being recognized.
This system allows the server to give the robot different content than that which is delivered to human visitors. And that means the search engine could be deceived.
Invisible Text
Invisible text is invisible because the font color is the same as the color of the background or background image.
In one example I saw, a site used the font color "snow" to make the text white on a white background. The author also used this font tag in a way that caused it to overlap another tag, thereby confusing the search engine robot further.
The example below uses a black color .gif as the background to hide black text. It also has a dhtml layer directly above it, to further hide the text.
<body bgcolor="#000000">
<table width="14%" border="0" cellpadding="6" cellspacing="0" bgcolor="#FFFFFF">
<tr>
<td background="black.gif"><font color="#000000">invisible text</font></td>
</tr>
</table>
<div id="Layer1" style="position:absolute; width:200px; height:115px; z-index:1; left: 5px; top: 8px; background-image: url(black.gif); layer-background-image: url(black.gif); border: 1px none #000000;"></div>
</body>
A robot can't detect whether text in a dhtml layer is the same as the background used in a layer below it. The layer can even be set off-screen so it is never visible to a person.
Link Farms
Link farms are still prevalent on the Web, even though search engines can detect their presence through link pattern recognition. Since link spamming is being done at a macro level, so search engines must be able to view a large sophisticated network of links and delete those that are machine-generated and not true, human-chosen links.
The hilltop algorithm is one filter that minimizes the advantage gained by hundreds of useless links.
Spamming Penalties
Each search engine has its own distinct prohibitions and related penalties. Each penalty is a response to the degree of threat the search engine that a given spamming technique represents.
Spammers may receive demerits, through which the ranking of their sites on a particular phrase might drop significantly. Alternatively, a zero PageRank penalty may be awarded to a particular page, or whole sites may be banned if the search engine so chooses.
Now that these techniques are widely known, I strongly advise you not to try them. The search engine engineers may be embarrassed that these tricks really do work, and will move swiftly to take action against spammers.
Oh What a Wicked Web We Weave
What's the final word on search engine spam? Well, that's between you and the search engines. Now that you know some of the popular spamming techniques in use, you'll at least know how to avoid using them. Once word gets out, the search engines will ban their usage.
To avoid the problems created by spamming, choose an SEO that can achieve legitimate results. Don't ask for top ten guarantees when guarantees are considered wrong by the search engines. Hire an SEO that offers the full package of content creation and development. You'll get your money's worth, the search engines will get rich, useful content, and your site will attract targeted, qualified users.