NER_TAGS = [ {"name": "organization", "label": "Organization", "color": "#F1CBCB"}, {"name": "metric", "label": "Metric", "color": "#CAEACA"} ] NER_DATA = [ [ {"text": "At "}, {"text": "Santander", "tag": "organization"}, {"text": " our mission is to help people and businesses prosper. "}, {"text": "We are always looking for ways to help our customers understand their financial health "}, {"text": "and identify which products and services might help them achieve their monetary goals. "}, {"text": "Our data science team is continually challenging our machine learning algorithms, working with "}, {"text": "the global data science community to make sure we can more accurately identify new ways "}, {"text": "to solve our most common challenge, binary classification problems such as: "}, {"text": "is a customer satisfied? Will a customer buy this product? Can a customer pay this loan? "}, {"text": "In this challenge, we invite Kagglers to help us identify which customers will make "}, {"text": "a specific transaction in the future, irrespective of the amount of money transacted. "}, {"text": "The data provided for this competition has the same structure as the real data we have available "}, {"text": "to solve this problem."} ], [ {"text": "Many people struggle to get loans due to insufficient or non-existent credit histories. "}, {"text": "And, unfortunately, this population is often taken advantage of by untrustworthy lenders. "}, {"text": "Home Credit", "tag": "organization"}, {"text": " strives to broaden financial inclusion for the unbanked population by providing "}, {"text": "a positive and safe borrowing experience. "}, {"text": "In order to make sure this underserved population has a positive loan experience, "}, {"text": "Home Credit", "tag": "organization"}, {"text": " makes use of a variety of alternative data--including telco & transactional information"}, {"text": "--to predict their clients repayment abilities. While "}, {"text": "Home Credit", "tag": "organization"}, {"text": " is currently using various statistical and machine learning methods to make "}, {"text": "predictions, they're challenging Kagglers to help them unlock "}, {"text": "the full potential of their data. "}, {"text": "Doing so will ensure that clients capable of repayment are not rejected "}, {"text": "and that loans are given with a principal, maturity, and repayment calendar that will empower "}, {"text": "their clients to be successful."} ], [ {"text": "Imagine standing at the check-out counter at the grocery store with a long line behind you "}, {"text": "and the cashier not-so-quietly announces that your card has been declined. "}, {"text": "In this moment, you probably aren’t thinking about the data science that determined your fate. "}, {"text": "Embarrassed, and certain you have the funds to cover everything needed for an epic "}, {"text": "nacho party for 50 of your closest friends, you try your card again. "}, {"text": "Same result. As you step aside and allow the cashier to tend to the next customer, "}, {"text": "you receive a text message from your bank. "}, {"text": "'Press 1 if you really tried to spend $500 on cheddar cheese.' "}, {"text": "While perhaps cumbersome (and often embarrassing) in the moment, "}, {"text": "this fraud prevention system is actually saving consumers millions of dollars per year. "}, {"text": "Researchers from the "}, {"text": "IEEE Computational Intelligence Society (IEEE-CIS)", "tag": "organization"}, {"text": " want to improve this figure, while also improving the customer experience. With higher "}, {"text": "accuracy", "tag": "metric"}, {"text": " fraud detection, you can get on with your chips without the hassle. "}, {"text": "IEEE-CIS", "tag": "organization"}, {"text": " works across a variety of AI and machine learning areas, including deep neural networks, "}, {"text": "fuzzy systems, evolutionary computation, and swarm intelligence. "}, {"text": "Today they’re partnering with the world’s leading payment service company, "}, {"text": "Vesta Corporation", "tag": "organization"}, {"text": ", seeking the best solutions for fraud prevention industry, "}, {"text": "and now you are invited to join the challenge. "}, {"text": "In this competition, you’ll benchmark machine learning models on a challenging large-scale dataset. "}, {"text": "The data comes from "}, {"text": "Vesta", "tag": "organization"}, {"text": "'s real-world e-commerce transactions "}, {"text": "and contains a wide range of features from device type to product features. "}, {"text": "You also have the opportunity to create new features to improve your results. "}, {"text": "If successful, you’ll improve the efficacy of fraudulent transaction alerts for millions of people "}, {"text": "around the world, helping hundreds of thousands of businesses reduce their "}, {"text": "fraud loss", "tag": "metric"}, {"text": " and increase their "}, {"text": "revenue", "tag": "metric"}, {"text": ". And of course, you will save party people just like you the hassle of "}, {"text": "false positives", "tag": "metric"}, {"text": "."} ], [ {"text": "How much camping gear will one store sell each month in a year? "}, {"text": "To the uninitiated, calculating sales at this level may seem as difficult as predicting the weather. "}, {"text": "Both types of forecasting rely on science and historical data. "}, {"text": "While a wrong weather forecast may result in you carrying around an umbrella on a sunny day, "}, {"text": "inaccurate business forecasts could result in actual or opportunity losses. "}, {"text": "In this competition, in addition to traditional forecasting methods you’re also challenged to use "}, {"text": "machine learning to improve forecast "}, {"text": "accuracy", "tag": "metric"}, {"text": ". The Makridakis Open Forecasting Center (MOFC) at the "}, {"text": "University of Nicosia", "tag": "organization"}, {"text": " conducts cutting-edge forecasting research and provides business forecast training. "}, {"text": "It helps companies achieve accurate predictions, estimate the levels of uncertainty, "}, {"text": "avoiding costly mistakes, and apply best forecasting practices. "}, {"text": "The MOFC is well known for its Makridakis Competitions, the first of which ran in the 1980s. "}, {"text": "In this competition, the fifth iteration, you will use hierarchical sales data from Walmart, "}, {"text": "the world’s largest company by "}, {"text": "revenue", "tag": "metric"}, {"text": ", to forecast daily sales for the next 28 days. "}, {"text": "The data, covers stores in three US States (California, Texas, and Wisconsin) "}, {"text": "and includes item level, department, product categories, and store details. "}, {"text": "In addition, it has explanatory variables such as "}, {"text": "price, promotions, day of the week, and special events. "}, {"text": "Together, this robust dataset can be used to improve forecasting "}, {"text": "accuracy", "tag": "metric"}, {"text": ". If successful, your work will continue to advance the theory and practice of forecasting. "}, {"text": "The methods used can be applied in various business areas, such as setting up appropriate "}, {"text": "inventory or service levels. Through its business support and training, "}, {"text": "the MOFC will help distribute the tools and knowledge so others can achieve more accurate "}, {"text": "and better calibrated forecasts, reduce waste and be able to appreciate uncertainty and its risk "}, {"text": "implications."} ], [ {"text": "Nothing ruins the thrill of buying a brand new car more quickly than seeing your new insurance bill. "}, {"text": "The sting’s even more painful when you know you’re a good driver. "}, {"text": "It doesn’t seem fair that you have to pay so much if you’ve been cautious on the road for years. "}, {"text": "Porto Seguro, one of Brazil’s largest auto and homeowner insurance companies, completely agrees. "}, {"text": "Inaccuracies in car insurance company’s claim predictions raise the cost of insurance for "}, {"text": "good drivers and reduce the price for bad ones. "}, {"text": "In this competition, you’re challenged to build a model that predicts the probability that "}, {"text": "a driver will initiate an auto insurance claim in the next year. While "}, {"text": "Porto Seguro", "tag": "organization"}, {"text": " has used machine learning for the past 20 years, "}, {"text": "they’re looking to Kaggle’s machine learning community to explore new, more powerful methods. "}, {"text": "A more accurate prediction will allow them to further tailor their prices, and hopefully "}, {"text": "make auto insurance coverage more accessible to more drivers."} ] ]