Organic SEO Blog

231-922-9460 • Contact UsFree SEO Site Audit
Showing posts with label data mining. Show all posts
Showing posts with label data mining. Show all posts

Monday, August 02, 2010

The Web's New Gold Mine: Your Secrets

The Wall Street Journal



A Journal investigation finds that one of the fastest-growing businesses on the Internet is the business of spying on consumers. First in a series.

Hidden inside Ashley Hayes-Beaty's computer, a tiny file helps gather personal details about her, all to be put up for sale for a tenth of a penny.

The file consists of a single code— 4c812db292272995e5416a323e79bd37—that secretly identifies her as a 26-year-old female in Nashville, Tenn.

The code knows that her favorite movies include "The Princess Bride," "50 First Dates" and "10 Things I Hate About You." It knows she enjoys the "Sex and the City" series. It knows she browses entertainment news and likes to take quizzes.

"Well, I like to think I have some mystery left to me, but apparently not!" Ms. Hayes-Beaty said when told what that snippet of code reveals about her. "The profile is eerily correct."

Ms. Hayes-Beaty is being monitored by Lotame Solutions Inc., a New York company that uses sophisticated software called a "beacon" to capture what people are typing on a website—their comments on movies, say, or their interest in parenting and pregnancy. Lotame packages that data into profiles about individuals, without determining a person's name, and sells the profiles to companies seeking customers. Ms. Hayes-Beaty's tastes can be sold wholesale (a batch of movie lovers is $1 per thousand) or customized (26-year-old Southern fans of "50 First Dates").

"We can segment it all the way down to one person," says Eric Porres, Lotame's chief marketing officer.

One of the fastest-growing businesses on the Internet, a Wall Street Journal investigation has found, is the business of spying on Internet users.

The Journal conducted a comprehensive study that assesses and analyzes the broad array of cookies and other surveillance technology that companies are deploying on Internet users. It reveals that the tracking of consumers has grown both far more pervasive and far more intrusive than is realized by all but a handful of people in the vanguard of the industry.

• The study found that the nation's 50 top websites on average installed 64 pieces of tracking technology onto the computers of visitors, usually with no warning. A dozen sites each installed more than a hundred. The nonprofit Wikipedia installed none.

• Tracking technology is getting smarter and more intrusive. Monitoring used to be limited mainly to "cookie" files that record websites people visit. But the Journal found new tools that scan in real time what people are doing on a Web page, then instantly assess location, income, shopping interests and even medical conditions. Some tools surreptitiously re-spawn themselves even after users try to delete them.

• These profiles of individuals, constantly refreshed, are bought and sold on stock-market-like exchanges that have sprung up in the past 18 months.

The new technologies are transforming the Internet economy. Advertisers once primarily bought ads on specific Web pages—a car ad on a car site. Now, advertisers are paying a premium to follow people around the Internet, wherever they go, with highly specific marketing messages.

In between the Internet user and the advertiser, the Journal identified more than 100 middlemen—tracking companies, data brokers and advertising networks—competing to meet the growing demand for data on individual behavior and interests.

The data on Ms. Hayes-Beaty's film-watching habits, for instance, is being offered to advertisers on BlueKai Inc., one of the new data exchanges.

"It is a sea change in the way the industry works," says Omar Tawakol, CEO of BlueKai. "Advertisers want to buy access to people, not Web pages."

The Journal examined the 50 most popular U.S. websites, which account for about 40% of the Web pages viewed by Americans. (The Journal also tested its own site, WSJ.com.) It then analyzed the tracking files and programs these sites downloaded onto a test computer.

As a group, the top 50 sites placed 3,180 tracking files in total on the Journal's test computer. Nearly a third of these were innocuous, deployed to remember the password to a favorite site or tally most-popular articles.

But over two-thirds—2,224—were installed by 131 companies, many of which are in the business of tracking Web users to create rich databases of consumer profiles that can be sold.

The top venue for such technology, the Journal found, was IAC/InterActive Corp.'s Dictionary.com. A visit to the online dictionary site resulted in 234 files or programs being downloaded onto the Journal's test computer, 223 of which were from companies that track Web users.

The information that companies gather is anonymous, in the sense that Internet users are identified by a number assigned to their computer, not by a specific person's name. Lotame, for instance, says it doesn't know the name of users such as Ms. Hayes-Beaty—only their behavior and attributes, identified by code number. People who don't want to be tracked can remove themselves from Lotame's system.

And the industry says the data are used harmlessly. David Moore, chairman of 24/7 RealMedia Inc., an ad network owned by WPP PLC, says tracking gives Internet users better advertising.

"When an ad is targeted properly, it ceases to be an ad, it becomes important information," he says.

Tracking isn't new. But the technology is growing so powerful and ubiquitous that even some of America's biggest sites say they were unaware, until informed by the Journal, that they were installing intrusive files on visitors' computers.

The Journal found that Microsoft Corp.'s popular Web portal, MSN.com, planted a tracking file packed with data: It had a prediction of a surfer's age, ZIP Code and gender, plus a code containing estimates of income, marital status, presence of children and home ownership, according to the tracking company that created the file, Targus Information Corp.

Both Targus and Microsoft said they didn't know how the file got onto MSN.com, and added that the tool didn't contain "personally identifiable" information.

Tracking is done by tiny files and programs known as "cookies," "Flash cookies" and "beacons." They are placed on a computer when a user visits a website. U.S. courts have ruled that it is legal to deploy the simplest type, cookies, just as someone using a telephone might allow a friend to listen in on a conversation. Courts haven't ruled on the more complex trackers.

The most intrusive monitoring comes from what are known in the business as "third party" tracking files. They work like this: The first time a site is visited, it installs a tracking file, which assigns the computer a unique ID number. Later, when the user visits another site affiliated with the same tracking company, it can take note of where that user was before, and where he is now. This way, over time the company can build a robust profile.

One such ecosystem is Yahoo Inc.'s ad network, which collects fees by placing targeted advertisements on websites. Yahoo's network knows many things about recent high-school graduate Cate Reid. One is that she is a 13- to 18-year-old female interested in weight loss. Ms. Reid was able to determine this when a reporter showed her a little-known feature on Yahoo's website, the Ad Interest Manager, that displays some of the information Yahoo had collected about her.

Yahoo's take on Ms. Reid, who was 17 years old at the time, hit the mark: She was, in fact, worried that she may be 15 pounds too heavy for her 5-foot, 6-inch frame. She says she often does online research about weight loss.

"Every time I go on the Internet," she says, she sees weight-loss ads. "I'm self-conscious about my weight," says Ms. Reid, whose father asked that her hometown not be given. "I try not to think about it…. Then [the ads] make me start thinking about it."

Yahoo spokeswoman Amber Allman says Yahoo doesn't knowingly target weight-loss ads at people under 18, though it does target adults.

"It's likely this user received an untargeted ad," Ms. Allman says. It's also possible Ms. Reid saw ads targeted at her by other tracking companies.

Information about people's moment-to-moment thoughts and actions, as revealed by their online activity, can change hands quickly. Within seconds of visiting eBay.com or Expedia.com, information detailing a Web surfer's activity there is likely to be auctioned on the data exchange run by BlueKai, the Seattle startup.

Each day, BlueKai sells 50 million pieces of information like this about specific individuals' browsing habits, for as little as a tenth of a cent apiece. The auctions can happen instantly, as a website is visited.

Spokespeople for eBay Inc. and Expedia Inc. both say the profiles BlueKai sells are anonymous and the people aren't identified as visitors of their sites. BlueKai says its own website gives consumers an easy way to see what it monitors about them.

Tracking files get onto websites, and downloaded to a computer, in several ways. Often, companies simply pay sites to distribute their tracking files.

But tracking companies sometimes hide their files within free software offered to websites, or hide them within other tracking files or ads. When this happens, websites aren't always aware that they're installing the files on visitors' computers.

Often staffed by "quants," or math gurus with expertise in quantitative analysis, some tracking companies use probability algorithms to try to pair what they know about a person's online behavior with data from offline sources about household income, geography and education, among other things.

The goal is to make sophisticated assumptions in real time—plans for a summer vacation, the likelihood of repaying a loan—and sell those conclusions.

Some financial companies are starting to use this formula to show entirely different pages to visitors, based on assumptions about their income and education levels.

Life-insurance site AccuquoteLife.com, a unit of Byron Udell & Associates Inc., last month tested a system showing visitors it determined to be suburban, college-educated baby-boomers a default policy of $2 million to $3 million, says Accuquote executive Sean Cheyney. A rural, working-class senior citizen might see a default policy for $250,000, he says.

"We're driving people down different lanes of the highway," Mr. Cheyney says.

Consumer tracking is the foundation of an online advertising economy that racked up $23 billion in ad spending last year. Tracking activity is exploding. Researchers at AT&T Labs and Worcester Polytechnic Institute last fall found tracking technology on 80% of 1,000 popular sites, up from 40% of those sites in 2005.

The Journal found tracking files that collect sensitive health and financial data. On Encyclopaedia Britannica Inc.'s dictionary website Merriam-Webster.com, one tracking file from Healthline Networks Inc., an ad network, scans the page a user is viewing and targets ads related to what it sees there. So, for example, a person looking up depression-related words could see Healthline ads for depression treatments on that page—and on subsequent pages viewed on other sites.

Healthline says it doesn't let advertisers track users around the Internet who have viewed sensitive topics such as HIV/AIDS, sexually transmitted diseases, eating disorders and impotence. The company does let advertisers track people with bipolar disorder, overactive bladder and anxiety, according to its marketing materials.

Targeted ads can get personal. Last year, Julia Preston, a 32-year-old education-software designer in Austin, Texas, researched uterine disorders online. Soon after, she started noticing fertility ads on sites she visited. She now knows she doesn't have a disorder, but still gets the ads.

It's "unnerving," she says.

Tracking became possible in 1994 when the tiny text files called cookies were introduced in an early browser, Netscape Navigator. Their purpose was user convenience: remembering contents of Web shopping carts.

Back then, online advertising barely existed. The first banner ad appeared the same year. When online ads got rolling during the dot-com boom of the late 1990s, advertisers were buying ads based on proximity to content—shoe ads on fashion sites.

The dot-com bust triggered a power shift in online advertising, away from websites and toward advertisers. Advertisers began paying for ads only if someone clicked on them. Sites and ad networks began using cookies aggressively in hopes of showing ads to people most likely to click on them, thus getting paid.

Targeted ads command a premium. Last year, the average cost of a targeted ad was $4.12 per thousand viewers, compared with $1.98 per thousand viewers for an untargeted ad, according to an ad-industry-sponsored study in March.

The Journal examined three kinds of tracking technology—basic cookies as well as more powerful "Flash cookies" and bits of software code called "beacons."

More than half of the sites examined by the Journal installed 23 or more "third party" cookies. Dictionary.com installed the most, placing 159 third-party cookies.

Cookies are typically used by tracking companies to build lists of pages visited from a specific computer. A newer type of technology, beacons, can watch even more activity.

Beacons, also known as "Web bugs" and "pixels," are small pieces of software that run on a Web page. They can track what a user is doing on the page, including what is being typed or where the mouse is moving.

The majority of sites examined by the Journal placed at least seven beacons from outside companies. Dictionary.com had the most, 41, including several from companies that track health conditions and one that says it can target consumers by dozens of factors, including zip code and race.

Dictionary.com President Shravan Goli attributed the presence of so many tracking tools to the fact that the site was working with a large number of ad networks, each of which places its own cookies and beacons. After the Journal contacted the company, it cut the number of networks it uses and beefed up its privacy policy to more fully disclose its practices.

The widespread use of Adobe Systems Inc.'s Flash software to play videos online offers another opportunity to track people. Flash cookies originally were meant to remember users' preferences, such as volume settings for online videos.

But Flash cookies can also be used by data collectors to re-install regular cookies that a user has deleted. This can circumvent a user's attempt to avoid being tracked online. Adobe condemns the practice.

Most sites examined by the Journal installed no Flash cookies. Comcast.net installed 55.

That finding surprised the company, which said it was unaware of them. Comcast Corp. subsequently determined that it had used a piece of free software from a company called Clearspring Technologies Inc. to display a slideshow of celebrity photos on Comcast.net. The Flash cookies were installed on Comcast's site by that slideshow, according to Comcast.

Clearspring, based in McLean, Va., says the 55 Flash cookies were a mistake. The company says it no longer uses Flash cookies for tracking.

CEO Hooman Radfar says Clearspring provides software and services to websites at no charge. In exchange, Clearspring collects data on consumers. It plans eventually to sell the data it collects to advertisers, he says, so that site users can be shown "ads that don't suck." Comcast's data won't be used, Clearspring says.

Wittingly or not, people pay a price in reduced privacy for the information and services they receive online. Dictionary.com, the site with the most tracking files, is a case study.

The site's annual revenue, about $9 million in 2009 according to an SEC filing, means the site is too small to support an extensive ad-sales team. So it needs to rely on the national ad-placing networks, whose business model is built on tracking.

Dictionary.com executives say the trade-off is fair for their users, who get free access to its dictionary and thesaurus service.

"Whether it's one or 10 cookies, it doesn't have any impact on the customer experience, and we disclose we do it," says Dictionary.com spokesman Nicholas Graham. "So what's the beef?"

The problem, say some industry veterans, is that so much consumer data is now up for sale, and there are no legal limits on how that data can be used.

Until recently, targeting consumers by health or financial status was considered off-limits by many large Internet ad companies. Now, some aim to take targeting to a new level by tapping online social networks.

Media6Degrees Inc., whose technology was found on three sites by the Journal, is pitching banks to use its data to size up consumers based on their social connections. The idea is that the creditworthy tend to hang out with the creditworthy, and deadbeats with deadbeats.

"There are applications of this technology that can be very powerful," says Tom Phillips, CEO of Media6Degrees. "Who knows how far we'd take it?"

Wednesday, April 28, 2010

Analyists: Facebook Seeks More Revenue, Not Control of the Web
Computer World

 
Moves could boost Facebook's marketing revenue if they don't lead to another user revolt

Contrary to a lot of online buzz, analysts say that Facebook's moves this week don't indicate that the company is looking to take over the Web.

Facebook on Wednesday unveiled a bevy of development tools aimed at enabling the social networking phenom to extend its reach across a greater expanse of the Web.

The new tools let operators of other Web sites share user data with Facebook, providing the social networking firm with new online advertising opportunities.

Industry analysts say the moves could affect the future breadth and pervasiveness of social networking, extending it to many new areas of the Internet, including news and e-commerce sites.

"Facebook isn't going to take over the Web, as much as they might try," said Dan Olds, an analyst with The Gabriel Consulting Group. "What this does signify is that social networking has reached the point where it's very big business and the stakes are high. Facebook sees the opportunity to make a power play and get into the online marketing game in a much bigger way. At this point, we don't know if Facebook has a gold mine here or not."

Rob Enderle, an analyst with the Enderle Group, agrees that Facebook is likely looking to position itself as a leader in the effort to create a more social Web so that it can better capitalize on its revenue potential.

"This is Facebook making strategic moves to own its customers and [get] the revenue these customers generate," said Enderle. "It is likely the first of a number of steps [toward] pushing [social networking] beyond what it is now. In fact, we may stop calling it social networking as it becomes an integral part of the Web experience."

Stuart Williams, an analyst with Technology Business Research, said Facebook is in a race to stake a position in the expanding business before other companies create alternatives tools aimed at attracting Facebook users and their marketing potential.

"Vendors -- Google, Microsoft, Salesforce.com, IBM -- are bringing social platforms to market all over the place for both business and personal environments," said Williams. "Users are getting more sophisticated about wanting more control over their online personas as well as their personal data, comments, photos, video and history. Facebook is monetizing the online personas of their users, by doing them the favor of integrating the Facebook environment with other social platforms and Web services. "

A key question, said Olds, is whether users will see the Facebook SEO moves as a favor or a flagrant misuse of personal information. Facebook users have a history of speaking up -- loudly -- about perceived threats to their privacy.

"It's not that this type of information - my likes, my dislikes, what I'm talking about now -- isn't already public in Facebook," Olds said. "It is, but it's primarily confined to my universe of friends and admirers. Facebook is going to give marketers a lot more information to pinpoint and direct advertising to me."

He predicted that "there's going to be a firestorm of criticism over this, Users are very touchy about privacy and being used against their will. This has the potential to mash those user hot buttons with a hammer."

Williams added that the new features could be a prime example of Facebook's tendency to simultaneously please and irritate its users.

"Users get more and easier access to other platforms but Facebook and its partners get more value from the aggregate usage and shared information," he said. "I see the clouds of a civil war on the horizon between users and the platform vendors as users want more discrete control over their history, privacy and data, and the platform vendors who drive advertising and data mining businesses."

Sunday, November 15, 2009

New Mining Techniques Probe Deeper Into Consumer Data
Wall Street Journal

Behind the scenes of a recent online shopping trip, Blue Kai , a startup company that collects Internet user data, was tracking when a Web surfer browsed for electronics on eBay, searched for cruises and checked out snowboards. It also tracked when a Web surfer researched Chevrolet sport utility vehicles on auto site Autobytel and priced flights to Durham, N.C., at travel site Expedia.

After collecting that kind of information, Blue Kai groups Web visits into categories of consumers. It then immediately auctions off the data from some of the sites to marketers and Internet companies, which in turn use it for consumer research and ad personalization.

The Web companies make money for selling the data about visitors to their sites, and Blue Kai takes a cut. The advertiser gets its coveted targeting.

While the idea of target marketing has been around a long time, marketers until recently have had a hard time buying on enough Web sites to make the targeting truly effective.

Blue Kai, like data-mining firm eXelate Media and others, are striking deals with thousands of Web sites to collect and sell data on their visitors that will be used for consumer research or ad targeting. Marketers, in turn, are using the information they buy to make better choices when buying ad space. It is a process that's proving especially useful when buying through ad exchanges, which are new systems that allow advertisers to bid directly on the ad space available on a large group of Web sites.

Buying such data would let a hotel chain, for instance, show ads featuring discounts in North Carolina to a person who recently shopped for a flight to Durham—and not just on the travel site, but on any of a number of sites across the Web.

Some Web sites are hesitant to sell data about the consumers visiting them to outside firms. Historically, the only way a marketer could buy ads on a Web site was through striking a deal with that site directly. Now, buying ads based on the data about visitors, rather than the content published on the site, could drastically change how media companies do business.

The data is becoming the most important component for marketers and Web sites. It tells them who their audience is

Refining digital ad buying practices could bring some steam back to the market for online display ads—those with text and pictures that border a Web page—a market estimated at $20.8 billion in 2009, down from $23 billion in 2008, analysts say. U.S. online ad spending on targeted ads will reach $1.1 billion this year, up from $775 million in 2008, according to research firm eMarketer.

"These companies are adding tremendous value to the whole advertising ecosystem," says Ross Sandler, an Internet analyst with RBC Capital Markets.

Tapping such data gives marketers a way to buy ads according to specific groups of consumers who are likeliest to be interested in a given product, says Curt Hecht. Mr. Hecht is president of Vivaki Nerve Center, a unit of Publicis Groupe that buys hundreds of millions of dollars of online ad space a year for companies such as Procter & Gamble and Wal-Mart Stores.

For instance, a credit card company can buy ads targeted to small business owners it knows are in the market for a new card, instead of buying ads on business-related Web sites.

Neither Blue Kai nor eXelate discloses specific Web sites that they buy and sell data from, citing agreements with those Web sites. In their privacy policies, some Web sites reveal that they sell data to third parties, but often do not list the particular company. EBay says in its privacy policy that it works with Blue Kai but doesn't allow the company to collect any personal information about consumers.

Travel sites Expedia and Kayak say they both sell consumer data in Blue Kai's auction, noting that the information is anonymous and not tied to the specific Web site.

The data brokers have different formulas for collecting and selling information on millions of Internet users across thousands of Web sites, from top retail and travel sites to social networks. Blue Kai, a Seattle company launched in September 2008, regularly records information on more than 160 million unique U.S. monthly visitors shopping on retail, travel and auto sites across the Web.

"The data is becoming the most important component for marketers and Web sites. It tells them who their audience is," says Omar Tawakol, chief executive at Blue Kai.

Some lawmakers, concerned about Internet privacy, are preparing legislation to make more transparent Web sites' tactics for collecting information on their users. In an effort to fend off legislation, data brokers say, they abide by industry standards and do not collect any personally identifiable information and sensitive data, such as health information. They also tout efforts to make their business practices more transparent to consumers.

Both Blue Kai and eXelate, for instance, feature sections on their Web sites to show consumers what information the company tracks and giving consumers the option not to be tracked.

Not all Web sites where Blue Kai tracks information sell data to outsiders. Some use the information to personalize their sites for individual users or for their own advertising purposes.

Some publishers fear that their competitors could buy data about the consumers visiting their sites and use it to steal customers.

IAC/InterActive, for instance, is testing the sale of consumer data tied to its e-commerce site Pronto through eXelate. "If we sell that data, it allows another sales team to sell our audience and compete against us," says Greg Stevens, president of IAC Advertising. "But if it is worth millions and millions and millions of dollars, then hey, maybe the paradigm has turned upside down."

Saturday, April 04, 2009

Facebook Searching For Data Center Money
As Posted at Data Center News

BusinessWeek reports that the popular online site Facebook is scouting for up to $100 million more in financing to build out its data center infrastructure.

The company reportedly has about 275 million users, almost triple last year's number. According to research from Alexa Internet Inc., it is the fifth most popular Web site in the world. Supporting all those users -- and their uploaded photos and videos takes a lot of servers and storage devices.

The company has at least four data centers in the U.S:. three on the West Coast and one on the East Coast. According to the BusinessWeek story, Facebook added 75 million new users in the past three months, more than its entire U.S. audience. So presumably it will look to start building data centers overseas.

Monday, March 16, 2009

Google's Data Mining System - Packaged For The Masses
As Originally Posted to The New York Times

Cloudera is the quintessential Silicon Valley story.

Three of the top engineers from Google, Yahoo and Facebook have teamed up with an ex-Oracle executive to tackle the problems inherent in quickly analyzing big piles of data. On Monday, they’re revealing a commercial product based on the open source software Hadoop, which provides the analytical magic behind the world’s biggest Web sites. The team at Cloudera, based in Burlingame, Calif., think they can extend Web smarts to the business world, aiding companies in retail, insurance, bio-tech and oil and gas.

Hadoop is the open-source version of the file system and MapReduce technology developed by Google. Google has used such software to rewire its entire search index, making it possible for the company to run ever-faster searches on cheap servers and to ask questions of its vast data stores and receive coherent answers.

Rather than keeping data locked in a central database, Google spreads information across thousands of servers. Engineers can then send out requests to these servers via MapReduce and gain new insights into peoples’ searching behavior and the relationships between Web sites. This process makes for more consistent Google search engine optimization results, and best of all, MapReduce keeps these complicated jobs humming along even when computers fail because of its ability to maintain a cohesive picture of all the systems.

While Google has kept the deep details on this technology to itself, the company did publish a couple of papers describing some of the underlying principles. That gave Doug Cutting, formerly a software consultant and now a Yahoo engineer, enough information to create an open-source take on the code.

Yahoo has since invested millions of dollars improving Hadoop and uses the technology to figure out what users should see on its home page, based on their surfing habits, and what ads to display next to search results.

Other Web 2.0 users, including Microsoft, Facebook and Fox Interactive Media, have picked up Hadoop as well.

The founders of Cloudera argue that the analytical powers of Hadoop can benefit a whole new class of businesses. For example, they want to show biotech firms new ways of analyzing genome and protein data and give oil and gas firms new ways of digging through their reservoir data.

The pitch has proved attractive enough for Accel Partners to pump money into the start-up. Diane Greene, the co-founder of VMware; Marten Mickos, the former chief executive of MySQL; and Gideon Yu, the chief financial officer at Facebook, have invested in the company as well.

While Hadoop remains free, Cloudera plans to sell support and consulting services around the software.

The backgrounds of the executives point to the classic Silicon Valley nature of the story.

Just 26, Jeff Hammerbacher has already worked on Wall Street and at Facebook after graduating from Harvard, where he earned a degree in math. Christophe Bisciglia, 28, arrived at Google after raising and selling horses online during his high school years.

Amr Awadallah, 38, arrived in the United States from Egypt and secured a job at Yahoo, where he helped develop Hadoop. And Mike Olson, the 46-year-old chief executive, is a database executive that sold an open-source software maker called Sleepycat to Oracle in 2006.

While Google could make a fuss about intellectual property rights to the technology, the company has given Cloudera its blessing. Mr. Bisciglia discussed the company with Google’s chief executive, Eric Schmidt, last March.

“He agreed that this technology is not just for researchers, and it’s good for Google to make this pervasive,” Mr. Bisciglia said. “The more data people create, the more data Google can slurp up.”

A number of prominent computer scientists have hailed Hadoop as the right answer for an age when companies have moved from dealing with gigabytes of data to terabytes and now petabytes (one petabyte is equal to 1 million gigabytes or 1,000 terabytes). It’s one thing to store all of that information and another thing to be able to mine it in an efficient manner.

“It is a new reality that people have the ability to store and analyze terabytes and petabytes of data,” Bisciglia said. “Now they need the tools to process it.”