Archive for May, 2008

Pandia Weekend Wrap-up May 25

Sunday, May 25th, 2008

Here are some of the search engine headlines we have found interesting lately:

Original post by Per and Susanne Koch

Original post by WiadomoÅ

The end of Live Booksearch and Search Academic

Sunday, May 25th, 2008

Woman with booksThe Live Search Blog reports that Microsoft is abandoning its Live Search Books and Live Search Academic projects. We argue that this might be a sign of a greater cultural weakness of Microsoft.

Microsoft will keep the current Live Search Books and Live Search Academics databases, and the data will be included in regular web search results.

The end of the digitization project

The databases will not be updated though, as Microsoft is also closing down its “digitization initiatives,” including the library scanning and their in-copyright book programs.

This means that Microsoft is, in essence, giving up fighting Google in an area that many has considered an important part of the future of search.

Google is clearly planning to become the one stop portal for all kinds of information, including the printed word, and book scanning and indexing is an important part of that strategy.

Too strong focus on short term profits

Our guess is that Microsoft has come the the conclusion that such an adventure would not be profitable, and that the main focus for search should be the regular web searcher, and not experts, academics and researchers.

The Live blog says:

Given the evolution of the Web and our strategy, we believe the next generation of search is about the development of an underlying, sustainable business model for the search engine, consumer, and content partner. For example, this past Wednesday we announced our strategy to focus on verticals with high commercial intent, such as travel, and offer users cash back on their purchases from our advertisers.

We are not so sure if this is a sensible approach. The scholars and experts belongs to the main opinion makers on the Web, and they were the ones that made Google a success by promoting it for free in their circles.

Moreover, they have a lot of purchasing power and there are bound to be more of them as the educational levels are raising all over the world.

Finally, there will have to be changes in international copyright laws, making written content more easily available, and when that happens “paper-based” high quality content will be a very attractive source of information.

Not all of Microsoft’s work in this area is wasted. They intend to provide publishers with digital copies of their scanned books. They are also removing their contractual restrictions placed on the digitized library content and making the scanning equipment available to their digitization partners and libraries.

The two search services include some 80 million articles and 750,000 scanned books.

The Swedish blog Internetbrus argues that this is a serious setback for academic search, as Live’s product has been much better than Google Scholar.

Is there something wrong with the Microsoft business culture?

We would actually like Microsoft to succeed as an online search property. Search needs alternatives to Google, also for Google’s sake, so that they do not become complacent and less innovative.

It seem, however, that the traditional economists and managers is in control of Microsoft, making it hard to develop grand visions of the future, and that this stops Microsoft from becoming the innovative alternative to Google we all need.

Microsoft, of all companies, has the money needed to be daring in this field. Instead they seem backward looking and timid.

This may partly explain Yahoo’s resistance to becoming part of the Microsoft empire. In spite of all its cash, Microsoft may actually end up suffocating what’s left of Yahoo’s innovative spirit.

Page Traffic
Ready To Get New Customers?

Original post by Per and Susanne Koch

Original post by WiadomoÅ

Teens produce ever more content online

Thursday, May 22nd, 2008

Social media — blogging, social networking and creation of digital material — is central to many teenager’s lives. 93% of US teens use the internet and 39% of online teens share their own creations online: Photos, stories, videos and more. This number is up from 33% in 2004.

These numbers are from the report “Teens and Social Media” from the Pew Internet and American Life Project. And the report holds other numbers of interest to many of Pandia’s readers: Educators, marketers, librarians.

Blogging is up, mostly due to the girls

28% of online teens have created their own blog (up from 19% in 2004) and 27% maintain their own web page.

Girls lead the charge in the teen blogosphere: 35% of teenage girls blog, compared with 20% of the boys. Most of the growth in teenage blogging from 2004 to 2007 is attributable to the increased activity of girls.

Real social networking

About 55% of American online teens have profiles on a social network site. But they do more than just maintain their profile: 70% read the blogs of others and 76% have commented on a friend’s blog post or social network item.

While the boys are not the most avid bloggers, they are more likely to upload videos to video sharing sites. Boys are twice as likely to upload videos as girls.

Digital images, both stills and videos, play an important part in teens’ social networking. 89% say that they receive comments on the items they post online. And this practice isn’t limited to images and videos. 76% say they comment on the blog posts of others.

These comments often turn into conversations, making this more than the online exhibitionism that teens are often accused of. This is real social networking where the contributors learn to know each other and learn from each other.

This is emphasized by the fact that few teens who upload photos share them without restrictions. Far more cautious than adults, only 21% of teens say they never restrict access to the images they upload. The number for adults is twice that: 42%.

What does it mean?

What can we learn from these numbers? For one thing, it seems the rate of contribution of content online is set to rise. In 2006 Jakob Nielsen found that in most online communities, 90% of users are lurkers who never contribute, 9% of users contribute a little, and 1% of users account for almost all the action. These numbers certainly don’t hold true for teenagers, who seem to be set to conquer the web as they grow up.

The numbers from Pew also indicate that teens are serious about their online presence, making acquaintances, building relationships and taking care both to contribute and to protect their contributions. Even though much of their online activity is for fun, they don’t take it lightly.

Want to know more about teens online? Have a look at our article about Tomorrow’s web customers.

Page Traffic
Ready To Get New Customers?

Original post by Per and Susanne Koch

Original post by WiadomoÅ

Careerjet job search engine

Tuesday, May 20th, 2008

Careerjet claims to be the most globally comprehensive job search engine on the web. It is comprised of over 50 different country-specific job search websites. Careerjet is translated into 20 languages, with both the job listings and interface localized to the language and location of the job seeker audience.

Careerjet uses automated multi-lingual semantic analysis technology to identify and classify online job offerings.

Over 58,000 websites are scanned daily for sites featuring job openings. Job seekers from all corners off the world will find open positions advertised on Careerjet for these regions: Europe, North America, Central America, Caribbean, South America, Africa, Middle East, Asia and the Pacific.

How accessible is all this information? Is the huge amount of information a boon or is it just too much information?

Is it any good?

Pandia is located in Norway, so I decided to have a look at Careerjet.no.

We found 5630 jobs in Oslo (Norway’s capital). That’s a little less than the leading provider of online job ads in Norway, Finn.no, which lists 6631 jobs.

You can search for jobs on national level, county level, municipality or city level and even find jobs in specific city districts.

When you get to the level of city district, it becomes obvious that the source information is inadequate. Many of the job listings do not provide data about city district, so we find few jobs this way.

You can also browse a large selection of job categories. Here a flaw in Careerjet’s analysis becomes obvious: Along with jobs in Norway, we also find job listings from neighboring Sweden.

There are options to sort your results to show full time or part time, permanent or temporary positions. You can also sort by relevance, date or salary, which are options that are not available at Finn.no.

I also had a look at Careerjet.com and the job market for Waschington DC. I found 47577 jobs. It was easier to browse the jobs here than on Careerjet.no because on the Norwegian site some jobs were listed in English, some in Norwegian.

Browsing Careerjet.com for US jobs, I wondered just how comprehensive the listings were: For New York State only these cities were listed: Buffalo, New York City, Rochester, Syracuse and Yonkers. For the Norwegian counties, every municipality was listed.

The next time I’m looking for a job, will I use Careerjet? Maybe.

If I was looking for a job abroad I definitely would. Careerjet offers some very convenient tools for people on the global job market.

Outsource SEO/PPC - $11 - $15 per hour
Outsource SEO/PPC
$11 - $15 per hour

Original post by Per and Susanne Koch

Original post by WiadomoÅ

Firefox addon summarizes Google search results

Monday, May 19th, 2008

Sensebot is a semantic search engine still in closed beta (we reviewed it last summer). The main boon of Sensebot is that it summarizes Google search results through text mining. But even if you’re not in on the beta testing, you can still profit from the technology by installing the Sensebot Summary plug-in for Firefox.

The extension integrates with the Google results page.

You do your Google searches as usual, but at the bottom of the results page a new button is added: “Summary”. When you click it, a text summary of the results is generated and displayed on the same page.

Save time on informational queries

According to Dmitri Soubbotin of Sensebot, a text summary is most useful for informational queries, rather than navigational or transactional. Here’s a sample query for “Vladimir Nabokov” to give you an idea.

Through text mining of the results, Sensebot Summary creates a digest on the topic of the query. The idea is to save users time by giving them a quick answer, saving them the trouble of drilling into each individual result page.

The Firefox extension is limited to summarizing one page of search results. Unlike the extension, the SenseBot search engine can parse up to 10 pages and generate a summary.

“It is quite an eye-opener to see a valuable piece of information brought from the 4th or 5th page of results,” says Subbotin.

The mechanics of Sensebot

As for the mechanics of SenseBot, Subbotin explains that it performs text mining of every source and “understands” its key concepts. Then it performs a multi-document summarization of all sources together, attempting to present the content as coherently as possible.

The query is also taken into account, and extra weight is given to the concepts related to it. So there is really a lot of work going on in the background. In the full SenseBot version the key concepts are shown in a “Semantic Cloud” above the generated summary.

Get the Sensebot Summary plug-in for Firefox.

The SEO dream team!
The SEO dream team!

Original post by Per and Susanne Koch

Original post by WiadomoÅ