Rubyslava #65Štvrtok 30.3.2017 19:00
Using distant supervision for information extraction (Alexandra Adamec) We are developing a customer screening system able to find and extract relevant information about the subjects from the web. The system learns to detect relevant sentences from the training examples. However, we don't have enough human annotated training data. I will present you how to use a distant supervision to heuristically label a training corpus using an existing knowledge base and how it can help to find the relevant information in the unstructured text.
Identity Management in practice (Katarina Valalikova) As platform usages continue to expand, so too will the number of identities, which corresponds with a need to govern. There are many reasons. First, it enables people to perform their assigned duties. Second, it prevents unauthorized subjects from committing unwanted actions. Third, it converts manual, error-prone and inefficient identity processes to be automated, verified and compliant. In this talk I want to share my challenges and my experiences in the Identity Management (IDM) field. I’ll touch basic technologies behind the IDM scene.
MongoDB aggregation framework (Andrea Šteňová) MongoDB as one of the biggest NoSQL databases grows from version to version. In this presentation, we will look into its powerful aggregation framework, which is modeled on the concept of data processing pipelines. MongoDB introduces multiple types of indexes - such as geospatial index, text index or hash index, which can be used in simple queries as well as in aggregations. We will use all of this in example by sorting and filtering data using custom ranking method.
Rubyslava / PyVo is sponsored by JetBrains, Toptal and kiwi.com. Get a free yearly licence for a JetBrains editor of your choice. Win ebooks from Manning Publications. Drinks are sponsored by Toptal (www.toptal.com) and kiwi.com.