Privacy Policy Andrea Ciufo - Page 2 of 4 - DataScientist and Proud Civil Engineer

Success is related to Hard Work or Luck? – TOEFL Writing Independent Task Example

“When people succeed, it is because of hard work. Luck has nothing to do with success.” Do you agree or disagree with the quotation above? Use specific reasons and examples to explain your position. I strongly believe the more you try, the luckier you are. Success is a combination of hard work, smart work and … Read more

Analytics Introduction, 3 Keys points

The King of a faraway kingdom decided to change the laws introducing the divorce. After 5 years from the law introduction he gathered the Black Knight and the White Knight and told them: -You, Black Knight, have to go from house to house and count the number of divorces that happened during these years. Instead … Read more

XGBoost in Python (Quickly) Explained (3min Read)

What is? XGBoost is an algorithm used for supervised learning problems. It means Extreme Gradient Boosting. How It works? You have to imagine a sequence of models and each model is trained from the error of its predecessor. Where It is applied? Classification And Regression Trees  is the base learner and you apply a gradient … Read more

My First Natural Language Processing Job Interview As a Selftaught DataScientist

A French startup (https://navee.co/) wrote me on LinkedIn for a challenging job interview (spoiler: It didn’t go well). They identify potential fraud on-line through Natural Language Processing and Image Recognition on online marketplace. E.g., a house picture on a marketplace that is also on a stock photo website with a standard message could be classified … Read more

Nine Priciples for Forecasting in Survey Analysis (With Scientific Bibliography)

While I was building a forecast model and I was looking for scientific confirmation regards some assumptions I discovered an interesting book, really suggested “Principles Of Forecasting: A Handbook for Researchers and Practitioners” written by J. Scott Armstrong, from Wharton School. This handbook deals with all kinds of decision models. It starts from the judgmental … Read more

The best selling drugs in Italy are the ones that could be advertised

The best selling drugs in Italy are the ones that could be advertised On the Ministry of Health Website, there is a open data section where you can find the information in a *.csv format on the top-50best selling drugs in Italy. I decided to investigate this dataset, grouping some information (Python Code Attached Below, … Read more

What are the best selling drugs in Italy?

The Italian Ministry of Health published a dataset on the most distributed * drugs through drugstores (here you can find the dataset). In methodological terms I have aggregated all the drugs with the same starting word E.g. All kind of “Tachipirina” packs (most sold paracetamol drug in Italy) are grouped in a single variable, regardless of whether … Read more