Yesterday we showed the top 10 WordPress categories worldwide. Wonder how we did it? Each month, DataSift receives approximately 15-25 million full-text WordPress blog posts into our platform. Recently, the question of what topics are discussed on blogs was raised by one of our customers. Rather than simply say “everything” – we decided to take a deeper look into the content and substance of WordPress blogs. Using DataSift’s Salience Topics and Salience Entities augmentations, we…

continue reading »

We run free workshops every Thursday and I’d like to give you an example of what we cover. I’m going to use ‘Configuring Streams with CSDL in the API’ as an example, but don’t be put off if you’re a novice. We have workshops that cover the absolute fundamentals of the platform right up to advanced workshops covering complex destinations and filter optimization. The Workshop All workshops are delivered remotely by an experienced DataSift instructor….

continue reading »

  Location Forward, London. Date August 27th – 29th For more information and to register go here now. What’s Node.js Knockout? Node.js Knockout is a global 48-hour coding contest featuring node.js. The plan for London For this year’s knockout, a bunch of London Startups with an interest in the Node.js, realtime web, open-sourcery and general awesomeness decided that we’d all get together and host a location in London for people to hangout at and develop…

continue reading »

Taking inspiration from DataSift Invaders and some other projects I’ve seen that take technology out of the ether and bring them into the real world e.g. the light painting WiFi project, I decided to take the DataSift augmented Twitter Firehose and experiment with making it more ‘tangible’. After digging around in my experiments/toy box I found the following bits; A TS-7553 embedded ARM SBC (single board computer) A Velleman K8055 USB interface board Some relays…

continue reading »

We will be very soon making some changes to our core API and I wanted to to explain why we were making them. One of the first things we built for DataSift was the REST API – this was based upon the code from TweetMeme which had served us so well (it still servers 10,000’s of requests per second). We designed the API based upon a lot of assumptions back then and with hindsight they…

continue reading »

This weekend the world descends on South By Southwest in Austin, Texas for music, film and interactive talks and panels. But one thing ties these topics together… after parties! But how can you find the venues to go to? Where are the most influential people? Well with Cluster we have answered that question for you. Using the power of DataSift to filter real-time data and augment it, we’ve combined that processing power with Foursquare place…

continue reading »

We have been working hard over the last few weeks to improve our filtering engine, in both efficiency and with new features. Firstly, lets cover the differences between the last iteration of the CDSL and this new version. Changes to CONTAINS operator We have a new more efficient way of searching for keywords and/or phrases that now replaces the old implementations of CONTAINS, CONTAINS_WORD & CONTAINS_PHRASE operators, and has merged them into a single CONTAINS…

continue reading »

A new year and we are super excited about what is coming in 2011. Big data and real-time are hot topics and we feel that DataSift is ready to take on the challenge of assisting our customers with the problems of dealing with finding the right data and soon how to analyse it. So what did we do in 2010? DataSift was built as a platform for consumers and businesses to sift through the real-time…

continue reading »

Happy new year! Big data and real-time are hot topics and we feel that MediaSift is ready to take on the challenge of assisting our customers with the problems of dealing with and analysing the right social media data. So what did we do in 2010? DataSift was built as a platform for consumers and businesses to sift through the real-time web to discover content. We launched the Alpha three months ago and within a…

continue reading »

We have updated our language detection service within DataSift, as we found that our previous version was unable to identify the language of interactions as reliably and efficiently as we had originally hoped it would.  If you’re wondering where to find it, go to Create Stream or Edit Stream and it’s in the CSDL Language Help area at the bottom of the list. Thanks to the improved efficiency we have added support for ten more…

continue reading »
Page 2 of 3123