Man walking on a path in a strange dark forest

In the rush to embrace the Big Data phenomenon, many organizations start just collecting data. These data collection programs vacuum up vast swaths of information and install Hadoop or some Apache project just to try to make sense of what is there. A few months go by and suddenly everyone in the organization is amazed by this data—after all, you can create reports by any conceivable dimension and just watch a bunch of data come…

continue reading »

Ensuring Data Quality

Dary Hsu 25th August 2015 0 Comments
Blue document binder with quality word place on graphs, charts and business evaluation reports. concept for management plan

In this era of big – no, Big – Data, we find ourselves surrounded by all sorts of data in all sorts of shapes and sizes. But as the old adage goes, quantity is not necessarily indicative of quality, and if the point of your Big Data program is to tease out insights and drive decisions based on hard data and analytics, it is important to pay attention to the quality of data that you…

continue reading »
data_science

A good business is all about people, process, and product, and above all I think people are the most important. And as we advance into a new age where data is king and we can make increasingly good predictions and have a significantly better understanding of our customers, we need to have the right data people on our team. The core of a Big Data team is the data scientist: someone who knows how to…

continue reading »
2015-07-23

Last week while in New York, DataSift announced some great new products including: VEDO Intent (active machine learning for categorization and classification) and our new PYLON Partner Portal. While we were out there we also held our first dev|datasift event in New York, a lunch and learn for folks interested in Facebook topic data, machine learning, or social data – and we packed the house! If you missed it, much of the content we discussed it…

continue reading »
bradtalking

A special thank you to everyone who made it out, and to the DataSift team members who came to support our first SF dev|datasift meetup! For those that missed it, this week we had our first San Francisco meetup, at the awesome Intersection for the Arts. Our next meetup will be in New York on June 23, and we hope some of you can make it! So what’d you miss? We had tasty food, (grilled…

continue reading »
Mind Map Team - Illustration

You have mass quantities of all sorts of data imaginable. And yet, to tease any insight out of that, you have to learn the equivalent of SQL or hire a bunch of data scientists to write queries for you full time. What is the alternative do this? It is called natural language processing, and it is the future of Big Data. Specifically, Big Data will become Big Democratized Data will be when computers and servers and services can understand…

continue reading »
65247667_thumbnail

Privacy, when it comes to Big Data, is a two way street. We must first promise to treat data carefully and protect it, but we also must plan to act when privacy is threatened or compromised. Both strategies are important. Here are some thoughts on tactics on each side of the ball. Offense Part of protecting data privacy is not being reactive. Treating data well in the first place goes a long way toward protecting…

continue reading »
iStock_000021266017_Medium

It can sometimes feel like chief data officers are being pulled in about ninety different directions, mainly because data is something that is so valuable as a business asset. You will get requests often from a variety of different departments within your organization and it can be tempting to try to service them all, but these requests deserve a thorough vetting—each and every one of them. Why? Because the data you keep represents people. These…

continue reading »
iStock_000053484858Medium

The privacy policy is the core element of the overall contract between your company and the prospects and customers that share their data with you. Here are three common mistakes companies make when building or examining their privacy policies. Reusing another company’s privacy policy as your own. A number of companies, especially smaller and newer businesses, think of a privacy policy as simply a checklist item on the project completion timeline for getting a website…

continue reading »
forgotten

If there is information about you on the Internet, do you have the right to get it deleted? If you did something a decade, two decades ago that is still showing up in searches when people Google you, do you have the right to ask Google to stop showing it? That’s the central question of the “right to be forgotten.” The European Union (EU) has always been well ahead of other jurisdictions on the road…

continue reading »
Page 3 of 612345...Last »