Fraud patterns keep changing frequenlty and it is important to capture current fraud patterns using most recent transaction data.Traditional supervised models don't perform very well. The aim of the project is to find semi-supervised/unsupervised methods to detect fraud patterns. In particular, we will be using DBSCAN and graph based methods to detect unusual patterns and alert stakeholders.
Working as an analyst for one of the biggest players in the pharmecutical industry, I had the opportunity to tell stories with data about the KPIs of drugs.I assessed impact of market events such as entry of competitor's drug and loss of exclusivity of Merck's drug on sales and revenue in the anti-biotic drugs space based on therapy area knowledge and treatment protocol. Collaborated with LATAM, APAC and Gulf marketing teams to design and create market opportunity analyser for Bridion(anti-anesthetic drug) to increase focus on creating more product value before its loss of exclusivity
Stress testing is an important aspect for financial institutions especially banks. It measures the health of the institution and the ability to withstand major financial crisis. As part of the team responsible to deliver an integrated stress testing framework to client, I automated the operational risk module. I also identified opportunity areas for cross selling products by evaluating cross sell index of product exposure for both retail and corporate clients.
A consulting project for a solar energy based start-up to understand potential customers based on customer sign-ups. The goal of this project was help the management understand features influencing customer sign-ups - Recommended marketing channels and optimization of mobile website. Also advised a third party vendor for easier access of data for future use.
Considering that in 2014 social users posted an average of 1.8 billion photos to the internet daily and the growth of image-centric platforms like Instagram, Snapchat and Pinterest makes analyzing visual data also very important to understand your audience.We tried to answer What should National Geographic do to increase engagement on its Instagram page?. Image analytics using LDA topic modeling on scraped data from the Instagram Natgeo handle, gave some insights into types of posts that are least and most engaging.
Twitter, with increasing followers, can be a social platform for CEOs to connect directly with customers. Analysed tweeting styles of CEOs from different sectors to ascertain the characteristics that make them influential. Stock price analysis (regression) showed influential CEOs can actually impact stock prices through tweets. Influential CEOs can leverage Twitter to turn negative sentiments of the crowd to positive ones and affect the market.
Today in the world of personalized marketing, it is important to understand each customer and target them accordingly. The task of analyzing customer behaviour from humongous data is a challenging one and this project attempts to recommend books to users based on their given ratings of other books. Collaborative filtering methods on books clustered using K-means improved accuracy and a reccomendation of mixture of books from different clusters can help solve the long tail problem and increase customer satisfaction and experience.
Marketing is all about want you want your customers to associte your brand with.User generated content can give insights on consumers' perceptions. Car reviews data scraped from Edmunds forum helped understand most discussed car brands and most discussed attributes of cars using NLP. Evaluated associations between brands using lift analysis and visualised it using Multidimesional (MDS) plot.
Fatality Analysis Reporting System (FARS) was created in the United States by the National Highway Traffic Safety Administration (NHTSA) to provide an overall measure of highway safety. It contains data on a census of fatal traffic crashes within the 50 States, the District of Columbia, and Puerto Rico. To be included in FARS, a crash must result in the death of a person (occupant of a vehicle or a non-occupant) within 30 days of the crash. Surprisingly, 47% of the fatal crashes happened during the day and just 27% of the accidents where due to drink drivers.
To understand how safe is the city of Austin, we set out to analyse Austin crime data and dig deeper into why crimes in Austin increased drastically in the year 2018. The data showed that non-violent crimes have been increasing but violent crimes were decreasing. The average processing time for violent crimes is 44 days where as for non-violetn crimes is 30 days.
Recent global phenomena related to climate change, call for a new strategy to collectively move towards sustainable energy and reduce our carbon footprint. In one such effort, building owners are incentivised to improve building efficiencies to reduce operating costs and emissions. In this project, building energy consumption is predicted which can then be compared to the actual running costs and decide upon the incentives. Operating costs from over 1000 building over a 3 year timeframe is used to build the prediction model.