Events possess a rich structure that is important for intelligent information access systems (information retrieval, question answering, summarization, etc.). Without information about what happened, where, and to whom, temporal information about an event may not be very useful. In light … Continue reading An Overview of The Event Extraction Task in NLP
Search queries, passport scans, barcode scans, your online shopping history, your photos on Instagram, your tweets on twitter, voice messages, every day news articles, and more, and more… All of these contain a huge amount of data… Data generation is … Continue reading Sequential Clustering
In this post, we’re analyzing the results returned by the readability metric in our news feed. If you haven’t checked our post about “How to measure the readability of a text?” before, you can read about it here. How Are We Measuring the Readability? The main part of analyzing a metric is to know how does it work. In the current version, we’re depending on the AARIBase metric for measuring the readability. So, let’s have a look first on how does AARIBase work. Here’s the AARIBase formula: AARIBase = (3.28 × NOC) + (1.43 × ACW) + (1.24 × AWS) … Continue reading Analysis of the Readability Metric Results in Almeta News Feed
Readability is the ease with which a reader can understand a written text, which accordingly indicates how effectively the text will reach the target audience. The readability of text depends on its content (the complexity of its vocabulary and syntax), … Continue reading How to Measure Text Readability?
In a previous article, How to Detect Clickbait Headlines using NLP? We introduced the task of clickbait detection and explored how it can be modeled within the domain of machine learning and NLP. If you are not familiar with the concept of clickbait detection, make sure to review it before continuing. In this post, we’re building a classifier for clickbait detection in the news headlines depending on a pre-trained Arabic Word2Vec model and we’re validating this solution. If you are not familiar with the Word2Vec concept you can refer to this Wikipedia article for more information. News Headlines Representation In … Continue reading Clickbait Detection Using Word2Vec Representation
Clickbait is a type of hyperlink on a web page that has catchy or provocative headlines difficult for most users to resist, they tell you exactly what you’re about to see, with just enough of a tease at the end … Continue reading How to Detect Clickbait Headlines using NLP?
In this post, we are exploring how Google’s AutoML can help us in Almeta in developing automatic Arabic language processing tools. Before start if you are not familiar with the term AutoML you can refer to our previous post on this topic. Who is Google AutoML for? and When to Use It? The targeted audience by Google’s cloud autoML are people who have limited knowledge in machine learning. The main goal of this cloud service is to let the user build his own AI model that is tailored to his business needs, if the provided services by Google’s AI API … Continue reading Google’s AutoML Overview
When applying machine learning models, we’d usually do data pre-processing, feature engineering, feature extraction and, feature selection. After this, we’d select the best algorithm and tune our parameters in order to obtain the best results. AutoML is a series of … Continue reading Automated Machine Learning (AutoML)
The CI/CD pipeline is one of the best practices for DevOps teams to implement, for delivering code changes more frequently and reliably. This is our third article on Software Deployment we advise you to check the first 2 articles in … Continue reading Continuous Integration & Continuous Delivery: CI/CD
What is Docker Images? Docker is a platform for developers and system admins to develop, deploy, and run applications with containers. A Docker image we are going to talk about contains everything needed to run an application as a container. … Continue reading Docker Images
Software deployment is the final stage of every software project. When all the hard work you have put in over the course of time goes live to be used by the target audience. It includes all the process required for … Continue reading Software Deployment
News stories are created every day at many news agencies. Users may receive news streams from multiple sources. Browsing in large-scale information spaces without guidance is not effective. Suppose, for example, a person who has returned from a long vacation … Continue reading Event Detection in Media using NLP and AI