Today we focus on text mining techniques, more specifically on topic modeling.The main goal is to find clusters of articles based on their content. These articles were scraped from the websites www.sme.sk, www.pravda.sk and www.dennikn.sk. As you might have already noticed, the tricky part is that we will be dealing with articles in Slovak, and that makes our job a bit harder. We will compare two techniques and we will see if they are able to deliver good quality.
In this post we will use modern Natural Language Processing techniques to find similar posts in a Facebook group.You have probably been in a situation where you want to post something in a Facebook group but you are not sure whether almost the same post already exists and is maybe just hiding on the next page.
In the last part of the tutorial, we managed to interactively draw location points on the zoomed in country and added a tooltip linking directly to our content. One thing we are missing is an overview of what content is available for which countries without having to manually click through all the points available.
Gut feeling used to be the biggest asset of successful businessmen in the past. Nowadays, intuition still plays an important role, but with all the available knowledge and technologies, there has been a significant shift. One of the most important sources of a competitive advantage these days is data. Big Data is a hype and undoubtedly a bandwagon to jump on. But how?
For this blog post we decided to jump on the PokémonGO hype and add a bit of science into the craze. Our goal is to give you the optimal portfolio of Pokémon to train, so you can be as effective as possible against a wide variety of opponents. As each Pokémon has its strengths and weaknesses, we created clusters of Pokémon with similar characteristics and looked at the few selected ones allowing the player to compete against as many different enemies as possible.
For our latest blog, we took discussion comments from the Slovak sports community website sport.sk and fed them to our deep learning model. We wanted to see, whether we could teach a model to generate these comments automatically from all the past comments we had and if it was going to learn the structures and nuances used in them. Enjoy! (Article is both in Slovak and English).
May was an eventful month. In the first two weeks, we were very excited to launch our data science bootcamp Basecamp! Now that the application process is open, all prospects can finally apply and set off on an adventurous journey to becoming qualified data scientists with us. What an exciting time! But that was just the beginning. In the second half of the month, we have visited three different European capitals to attend three conferences: Big Data Universe in Budapest, Big Data & Data Science Conference in Paris, and last but not least Pioneers Festival in Vienna.