EnterpriseSearchCenter.com Home
  News   Features   White Papers   Research Reports   Web Events   Conferences  
 
RESOURCES FOR EVALUATING ENTERPRISE SEARCH TECHNOLOGIES
October 13, 2010

Table of Contents

SEO and the Business of Content
Lexalytics Unveils Sentiment Analysis of Short Form Content
Janya to use ISYS’s Document Filters for Embedded Text Extraction
In praise of information exploitation
Connecting users to knowledge
dtSearch Engine Enters Beta
Stealthy solution for voluntary compliance
Enhancing B2B e-commerce
DITA comes of age

SEO and the Business of Content

With the proliferation of search-oriented online content providers such as AOL, Yahoo!, Demand Media, and About.com, internet users are increasingly likely to find that most of the general searches they do return results from these SEO-oriented content creators. Whether this is a good or a bad thing from the user perspective remains to be seen—and opinions vary—but one thing is certain: Competing with these search-result-savvy content providers can present challenges for traditional publishing companies and enterprises that hope to increase their rank on search engines to attract visitors to their own sites.

Most general searches these days result in a high percentage of results that reflect domains such as About, eHow.com, Suite101.com, Examiner.com, etc.; all are examples of sites whose content is created by literally thousands of contributors hired to create content to appeal to the masses.

Why? It’s a combination of the sheer volume of information owned and controlled by these sites and the power of SEO (search engine optimization). The content game—once dominated by traditional publishers—is increasingly something organizations of all types are participating in. It is also something that traditional search companies such as Yahoo! and Google are moving into full force. Thus, search and content are becoming more and more intertwined.

Satisfying Searchers
Information search results were once almost solely the purview of name brand content companies. Today, however, top results (which may in fact be quite useful results) often come from cryptic sources, and information sources are not always immediately apparent. Demand Media, for instance, purchased Cracked.com in 2006, and its writers regularly provide content for the “Top 10” lists that appear prominently on Cracked’s homepage, with no mention of Demand Media to be found. Other content providers such as Associated Content provide a myriad of sources with content that may or may not be exclusive, meaning that web searchers have the potential to find the same information, from all of these aggregators, in multiple places. That said, previously unknown brands are becoming increasingly well-known, for example, About and eHow.

Brand name or not, sourced or not, the value of the content is in the eye of the consumer.

Giovanni Calabro, VP of user experience at Siteworx, notes that casual internet searchers are happy if the content that comes back to them eventually leads to the information they’re looking for. He doesn’t believe that they pay much attention to the source of that content. “If I’m looking for Lindsey Lohan, for example, if whatever comes back leads me to content about her, I’m happy. I don’t know if they’re going to sense that content as being generic or not. They’re looking for something so general, it doesn’t matter where they’re receiving it from.”

The good news, at least for now, is that the rise of companies such as Demand Media and eHow only reinforces the fact that there’s plenty of content demand and room for many content sources on the web. The trick, of course, is that everyone wants to sit at the top of that ever-valuable results list.

From General to Specific, and Everywhere in Between
Content providers such as Demand Media are, not surprisingly, attempting to feed the growing demand for content from websites hoping to draw eyeballs that attract advertisers. The company creates inexpensive content and sells it to companies that want content to make their sites more attractive via search engine results. In fact, article titles are literally driven by actual searches performed by content consumers, which Demand Media believes promotes the creation of content that aligns with consumer interests.

This turns the traditional search ranking model on its head, in that popular content used to rise in the results as users found it, mentioned it, linked to it, etc. Organizations approaching content in this way are concerned about drawing traffic to their sites to boost their business, whether the business model is focused on revenue generated through advertisers or through online sales. It appears that there’s ample opportunity for a multitude of approaches, according to industry experts—as long as optimizing for search is in some way a key part of the plan.

“Yes, AOL and others will have high overall page rank, and you’re likely not going to be able to compete with them there,”  according to Mike Jacobs, chief services officer of IMARKETING  LTD. However, he adds, “you don’t have to.” Search engines, he says, are looking for quality and relevant content. “Having content specific to a search is just as important as overall page rank to Google, and arguably more so.

“One site can’t be all things to all people, and Google knows this. AOL Seed and Demand [Media] will segment their content over multiple domains to a point but can’t have the laser-focus of an organization with a dedicated site,” Jacobs says. His advice: “Don’t try and be all things to all people, but [be] the expert in your area. Be the definitive source on the specific niche you’re targeting. Don’t go too wide, or you’ll just start to look like a lesser version of the goliaths you’re competing with.”

Companies hoping to increase their search rankings need to adhere to the basics, says Calabro, who stresses that SEO is still a very important part of any web strategy. However, he finds that SEO tends to bore most companies. “They say ‘we’ve done the SEO thing.’ But I can guarantee they haven’t,” he says. “If you’re not search engine optimized, you have a problem.”

Danielle Leitch, EVP of strategy for MoreVisibility, agrees. “SEO should always be the first focus,” she says. The competition “can be overcome if a company puts a good plan in place and then acts on it,” she says. “The challenge is really determining [the competition’s] strategy.” And, as others point out, businesses have always faced the challenge of competing with businesses that have access to more resources. The challenge is the same. It’s just the playing field that has changed.

Niche Marketing Generates Results
David Saries is an SEO expert with Dun & Bradstreet (D&B). It’s really no different than the traditional marketing environment that businesses have always competed in, says Saries. Just as small retail stores go up against big box retailers and often succeed, online marketers can effectively compete against the content providers that are pumping out massive amounts of content, he says. They can use the differentiators they have to drive traffic to their sites. Saries tells of an engineer he used to work with who now makes custom guitars and lives in San Jose, Calif. “He ships his guitars all around the country and perhaps the world,” says Saries. “It’s a very specific type of instrument that you’re not just going to walk into any music store and buy.”

Businesses will always have to compete with the Walmarts, the Walgreens, and the Barnes & Nobles around every corner, he says. Those that can successfully identify and leverage their unique differentiators that will appeal directly to a specific, target audience will be successful. These are the same differentiators, he notes, that they would have used in a Yellow Pages ad or a brochure or in talking to customers, but they now use them in an online context.

“To the extent that businesses can understand that and be more specific in their content and the pages they’re creating on their websites, they will be found,” he says. And there are a number of successful online businesses that have found that they can do just that by targeting a narrow niche and delivering specific content that appeals to that niche.
Bruce Kasanoff, co-founder of Draw the Dog, a website that was introduced in late 2009 that is based on a user-generated content model, is one example. “Never go head to head with a giant,” says Kasanoff. “Kick him in the knee, run between his legs, and hit him from behind.” On Drawthedog.com, unique cartoons are created on a daily basis. The cartoons are based on photos and stories about dogs that are contributed by visitors to the site. The site is growing fast, says Kasanoff, and it’s not solely because of SEO.

“We pay attention to SEO, but it’s not where we get our traffic,” he says. Traffic to the site has been built primarily by reaching out individually to dog rescue groups with win-win propositions, he says. “They pass the word about us, and we donate lots of art they can use to raise money for their nonprofit activities. These groups power our growth, and leveraging them requires building one-to-one relationships, something machines suck at.” In addition to these partnerships, word of mouth is generated through the contributions of the users. “Every cartoon we create is inspired by stories or photos dog owners send us,” says Kasanoff. “We make dogs famous, and every time we publish a new cartoon inspired by a dog, the owner emails, Facebooks, etc., everyone they know.”

For Kasanoff, success has been achieved through a narrow focus—the creation of relationships with site visitors who can actually become part of the site’s content and are eager to share it as well as partnerships with organizations that can help to spread the word about the site. Identifying a specific target market, clearly understanding the market and what needs and interests its members have, and leveraging partnerships are all basic marketing strategies that have worked for marketers for years. The good news is that these techniques can work online as well.

One misstep that some online marketers make is focusing on a market that is too broad. While some are interested in attracting a national following, others can and should be satisfied to draw from their local market area or from a narrow national market as Kasanoff has done. Once that narrow market has been defined, the next step becomes determining how that market is most likely to attempt to reach out to find you while online.

With this tactic, specificity is key. It’s the little things that matter, says Saries. Getting specific can generate results for businesses, he says. “AOL or Demand tend to have more generic articles or videos on some topic such as how to paddle a canoe. However, a local canoe rental business might specifically serve particular rivers or lakes that can allow them to stand out in geo-based searches or appear in local review sites such as Yelp,” he says.

Long Tail and Universal Search
Just because you can reach the masses online doesn’t mean that you need to, or should even want to. More is not always better.

Frank Dale, VP of Compendium Blogware, focuses on the “long tail search” in his work with clients. Long tail searches are simply those that are based on longer search phrases: “size 11 red men’s Nike running shoes +California” versus “running shoes,” for instance.

Saries agrees that long tail search holds promise for organizations trying to drive traffic with content as well as traditional content providers. Importantly, he notes, search engines are finding that users are using longer, more specific search phrases to find detailed answers. “By targeting the specific terms most related to a business’ products or services, they can compete for the most valuable customers,” he says.
Organizations “win” with long tail search phrases—winning meaning that their websites show up higher in the list of results returned by the search engine. Long tail search, says Dale, is not something that the very large content providers are really interested in. They’re looking more at two- to three-word phrases, so “long tail is a little bit too specific for them,” he says.

Alhan Keser, CMO for Blue Fountain Media in New York City, a web design, development, and marketing company, says that another key point about businesses that use long tail search is that they have a higher conversion rate—the percentage of visitors who actually make a purchase. “The reason for this is that someone looking up a product or service using very specific words is most probably at a more advanced stage of the buying cycle than someone searching for a generic term,” he says.

SEO will remain important for some time, and it is important for organizations to keep an eye on how the world of online search continues to change and what the future will bring in terms of both opportunities and challenges. The next big thing on the horizon right now is universal search, according to More Visibility’s Leitch. Search engines are going to be “relying heavily on universal search, which will provide users with a combination of results. When you type a word in a search box, you’ll see blogs, news, tweets, images, and videos.” This, she says, means that, more than ever, site owners will need to make sure that they’re managing all of their digital assets, not just text, to maximize their presence online. In other words, businesses will want to think about all of the various ways they can get information out on the web, beyond their website. This might include YouTube, paid advertising, social media, article placement, and news release placement.

Inbound links present another opportunity, says IMARKETING’s Jacobs, who notes that traditional content providers have an edge here. “Many other quality content sites won’t want to link to a monolith like AOL,” he says. “You want links from quality sites relevant to the topic area and, hopefully, referencing the topic in their anchor text.” In addition, he suggests, “encourage them to go deep, linking to deep site pages to really help push those pages to the top.”

All of this can seem a bit overwhelming, but “it’s not rocket science,” asserts D&B’s Saries, who recommends that businesses interested in expanding their online presence begin by taking advantage of the many sources of information that are readily available, often at no cost. “Google publishes information on SEO tactics, and they have a blog for webmasters that can help you understand some key metrics. They have a whole video series on YouTube that you can watch and tutorials on how to use a free tool like Google Analytics to better understand your market.”

Leitch advises website owners to decide which particular tactics are best for them and to not worry about gaining a presence across all channels. “You’re not going to be successful if you spread yourself too thin,” she says. “Having a plan of attack is important. Instead of trying to juggle one or more blogs, use press releases, post on YouTube, and use LinkedIn and Facebook; pick what’s likely to be most effective for your business and the community your company is trying to talk to,” she says.

A good starting place to develop that focus is to get a handle on how traffic is currently coming to their site, says Saries. “Start to see who’s visiting your site, how they’re getting there, when they come. Then, you can go further and further to say ‘they’re coming from Facebook or Twitter or Yahoo Search, or maybe from a local directory.’” This information can provide good direction in terms of where to focus additional efforts to draw traffic to the site, he says.

And, Jacobs adds, take advantage of the opportunity to learn from the behemoths. “Learn from the big boys, and don’t repeat their mistakes,” he says. “Let them spend the big dollars testing. See what they’re doing, see what works and what doesn’t, and modify your strategy accordingly.” In the end, though, he stresses focus on quality. “Better content will get you more readers, more links, and more activity, and the search engines will notice.”

Yes, the massive content providers such as AOL and Demand Media are online presences to be reckoned with. However, there remains an insatiable demand for a wide range of content—to inform complex decision making, to form relationships with organizations, to entertain, to answer immediate questions, and more. So there is room for content producers of all types. Yet this newfound focus on search-driven content production shines a spotlight on the importance of search in the content industry, reinforcing the increasing power of search in the content business.


Resources

Blue Fountain Media
www.bluefountainmedia.com
Compendium Blogware
www.compendium.com
Draw the Dog
www.drawthedog.com
Dun & Bradstreet
www.dnb.com
Google
www.google.com
IMARKETING LTD.
www.imarketingltd.com
More Visibility
www.morevisibility.com
Siteworx
www.siteworx.com

Back to Contents...

Lexalytics Unveils Sentiment Analysis of Short Form Content

Lexalytics, Inc., a software and services company specializing in text and sentiment analysis, announced the availability of enhanced reporting on the conversations occurring around, about, and between different accounts on Twitter based on the sentiment analysis of commonly used emoticons and acronyms.

With the use of emoticons, abbreviations, and confusing "social speak" grammar, micro-blog services such as Twitter present a difficult task for natural language processing systems. For acronyms, Lexalytics parsed thousands of tweets to get to hundreds of common acronyms and emoticons. The team then made decisions on whether each acronym (such as LOL for Laugh Out Loud) was sentiment-bearing, needed to be expanded, or should be treated as simply an interjection.

With emoticons, Lexalytics found that some are obviously positive (such as :D) or negative (:<) while others are considered more neutral. For the @ sign, Salience part-of-speech tags the @ tagged string as a "MENTION" which can be used for further reporting. In particular, @ tagged strings will return as people entities, with the associated sentiment, themes, etc.

Additionally, # sign (hashtags) are part-of-speech tagged as @hashtag. These do not report back as any sort of entity type. Hashtags are typically used as a lightweight "tag" for the content of the tweet. This information can be used by Salience for further processing as a tag.

(www.lexalytics.com)

Back to Contents...

Janya to use ISYS’s Document Filters for Embedded Text Extraction

ISYS Search Software, a developer of embedded search and federated access solutions, is teaming up with Janya to provide its ISYS Document Filters solution for use in the latter's Semantex text analytics platform. The augmented platform will give customers a complete solution for information integration, trend mining, and analysis of large volumes of unstructured content, according to the companies.

Janya, Inc. is a developer of solutions that transform unstructured and semi-structured data into cohyerent information for government agencies, commercial enterprises, and academic users.

(www.janyainc.com, www.isys-search.com)

Back to Contents...

In praise of information exploitation

 Vivisimo has released the newest version of its Velocity platform, which, the company says, introduces features designed to drive quantifiable revenue increases and productivity savings.

Although the new features were inspired by the trending adoption of Velocity as a sales enablement and customer service application, Velocity 8.0 carries with it new technologies applicable to all types of enterprises, both private and government.

The company explains that a key new component of Velocity 8.0 is IO Pro, which introduces new controls for business users, content owners and knowledge workers to optimize the way content is discovered by and presented to users. Business users can use IO Pro to promote and highlight the most relevant content, best bets and other important information to users. Content owners can configure relevant acronyms, synonyms and related terms to be displayed upon the context of the original inquiry, helping increase recall. With IO Pro, Velocity 8.0 greatly reduces the need for IT resource consumption by granting greater control over information delivery to those individuals closest and most knowledgeable about the needs of users, according to Vivisimo.

Recognizing organizations’ need for a single point of access, Vivisimo reports augmenting the value of SharePoint by integrating Velocity’s proprietary contextual information and organization features directly in the SharePoint interface. Velocity 8.0 allows users to navigate through all of their information assets such as e-mail, archives, file shares, CRM data and more directly in the SharePoint interface, driving a single point of access to all information.

Vivisimo says many information access solutions frequently waste 90 percent of the processing power on a single machine because they cannot leverage multiple cores. Instead of wasting that 90 percent of processing power, Velocity 8.0 can use all of that processing power and return results as much as 10 times faster, thus allowing more complex discovery tasks to be performed for the user in less time, utilizing overall fewer resources.

Further, says the company, as users form a query, they are automatically recommended related terms based on what is typed. Within a secure enterprise, users are only shown terms that are in content that they are authorized to view.

Additionally, the desktop search update enhances and improves the overall quality of Velocity's federated search connector to the Windows Desktop Search (WDS) application, including support for Windows 7.

Back to Contents...

Connecting users to knowledge

Text analytics provider TEMIS has made available its LuxidBar knowledge discovery service via free public download from its Web site. LuxidBar extends the benefits of content enrichment services to the user’s desktop and powers an enhanced, more productive navigation experience, the company claims.

LuxidBar was initially made available to TEMIS’ customers as a new gateway to the Luxid 5.2 Content Enrichment and Information Discovery Platform. The publicly available LuxidBar connects to a Luxid Content Enrichment Platform, hosted and maintained by TEMIS in the cloud. The platform performs a broad range of business and scientific entities extractions together with their semantic relationships.

LuxidBar is a lightweight component for Internet Explorer and Firefox that delivers four key features:

  • identifies and highlights key information within any Web page or document, enabling the user quickly to spot, read and navigate within the most relevant content;
  • inserts smart links on the fly within the text, enabling the user further to navigate and gather additional information in context;
  • displays information analytics dynamically to provide the user interactive graphical views of the document content; and
  • summarizes any webpage or document by displaying only key sentences explicitly referring to the user’s topic of interest.

LuxidBar is available for free download here.

Back to Contents...

dtSearch Engine Enters Beta

dtSearch Corp. announced the beta release of the dtSearch Engine, which makes available dtSearch's file format and data searching support for use in several internet, intranet, and commercial applications. The dtSearch Engine beta includes native 64-bit Visual Studio 2010 support as well as a .NET 4.0 SDK, which covers a sample application for the Microsoft Azure CLOUD platform, the dtSearch API, and the Spider API. Another notable feature of the beta involves performance enhancements for hierarchal sorting in cases involving millions of documented metadata tags or database records.

Aside from the dtSearch Engine, the beta also involves the rest of the dtSearch product line. Some of these products include dtSearch Web with Spider, dtSearch Publish, and dtSearch Desktop with Spider.

(www.dtsearch.com)

Back to Contents...

Stealthy solution for voluntary compliance

When ACS contracted with a government client to conduct a compliance audit, it recognized the need for intelligence gathering software that could scan the client’s entire infrastructure. The client wanted ACS, a Xerox company, to perform a risk assessment and to remediate any issues arising from the voluntary audit.

To carry out the project, ACS chose an enterprise search infrastructure solution from ISYS Search Software. ACS particularly liked the fact that the software would conduct the audit without disrupting worker productivity.

Paul McDonough of ACS/Xerox says, "What we were particularly impressed with during our initial tests was ISYS’ ability to scan multiple computers across multiple networks very effectively and efficiently. What’s more, it proposed a very stealthy solution, which enabled us to carry out our work behind the scenes without any user disruption."

According to ISYS, by leveraging its solution, ACS developed a network scanning, audit and risk assessment tool that proactively scans user workstations for potentially sensitive information protected by the Health Insurance Portability and Accountability Act (HIPAA) and by company policies.

McDonough says, "Through this system, we helped shield the customer from legal implications or costly penalties that are routinely levied by the government for HIPAA non-compliance. We see the ISYS system as a valuable solution that will help our customer perform annual voluntary compliance audits."

Concerning the project, ISYS reports that the implementation help to:

  • identify program data and files that need to be "locked down" or purged,
  • dynamically configure client-specific data search and algorithms,
  • cluster and define risk at the computer and/or organization level,
  • identify patterns of inter-departmental information sharing that might suggest potential projects for data consolidation, and
  • establish proactive voluntary compliance plans.

Back to Contents...

Enhancing B2B e-commerce

Bridgeline Digital has unveiled iAPPS V4.5, which expands B2B functionality within iAPPS Commerce. Further, says Bridgeline, with the implementation of ISYS Search Software, the entire iAPPS Product Suite now benefits from a far more advanced search solution than existed previously.

Additionally, enhancements to Order Workflow now enable configurable business rules around orders and make it possible for iAPPS to call out to ERP or other back-office systems to determine instantly if an order can be processed. iAPPS Commerce user experience for both buy-side and sell-side are given equal consideration.

Bridgeline Digital describes the iAPPS Product Suite as an SaaS solution that unifies content management, e-commerce, e-marketing and analytics capabilities--enabling users to swiftly enhance and optimize the value of their Web properties.

Back to Contents...

DITA comes of age

SDL has released Trisoft 2011 for DITA (Darwin Information Typing Architecture), the XML standard for technical writing. SDL says the new release of its Component Content Management software, Trisoft, extends SDL’s leadership position by offering support for the important new DITA 1.2 standard, currently in beta and on track for release by the Organization for the Advancement of Structured Information Standards (OASIS) technical committee in 2010.

With DITA 1.2 support, SDL Trisoft can now be used out of the box with the DITA Learning and Training specialization, the new task model and the machine-industry specialization. Users can also take advantage of powerful new ways of managing content variations and achieving reuse with DITA 1.2’s new capabilities of "keyref", "conkeyref", "conref push" and "conref ranges".

In addition to supporting the DITA 1.2 standard, Trisoft 2011 offers new authoring and publishing capabilities designed to significantly enhance the user experience. Users of the SDL Trisoft Web client will experience an entirely new look and feel, making writers’ online experience more enjoyable and efficient, claims SDL. Authors will also benefit from a new search architecture that gives them the ability to search for information faster and with more granular precision. The new search includes the ability to search and find content inside XML elements and attributes, variables and conditions and to identify precisely the context in which content resides.

SDL further believes lead authors will appreciate enhanced capabilities offered in the Publication Manager, engineered to save time developing content and releasing product publications. The new capabilities include the ability to save a new version of an open publication, to check in/out from the Browse Repository dialog, to find most recently used items, to save output formats in the next version of a publication, to see an InContext Preview of Content, to add additional fields to the Baseline tab and to autocomplete items that are added when building a publication.

Back to Contents...
 
[Newsletters] [Home]

Problems with this site? Please contact the webmaster. | About ITI | Privacy Policy