Mark Needham

Python
Mark NeedhamMarch 19th, 2018
0 122
Yelp: Reverse geocoding businesses to extract detailed location information
I’ve been playing around with the Yelp Open Dataset and wanted to extract more detailed location information for each business.…
Read More »
JavaScript
Mark NeedhamJanuary 2nd, 2018
0 130
Leaflet: Fit polyline in view
I’ve been playing with the Leaflet.js library over the Christmas holidays to visualise running routes drawn onto the map using…
Read More »
Python
Mark NeedhamDecember 14th, 2017
0 107
scikit-learn: Using GridSearch to tune the hyper-parameters of VotingClassifier
In my last blog post I showed how to create a multi class classification ensemble using scikit-learn’s VotingClassifier and finished…
Read More »
Python
Mark NeedhamDecember 11th, 2017
0 95
scikit-learn: Building a multi class classification ensemble
For the Kaggle Spooky Author Identification I wanted to combine multiple classifiers together into an ensemble and found the VotingClassifier…
Read More »
Python
Mark NeedhamDecember 6th, 2017
0 53
Python: Learning about defaultdict’s handling of missing keys
While reading the scikit-learn code I came across a bit of code that I didn’t understand for a while but…
Read More »
Python
Mark NeedhamDecember 5th, 2017
0 114
Python: Combinations of values on and off
In my continued exploration of Kaggle’s Spooky Authors competition, I wanted to run a GridSearch turning on and off different…
Read More »
Python
Mark NeedhamDecember 1st, 2017
0 106
scikit-learn: Creating a matrix of named entity counts
I’ve been trying to improve my score on Kaggle’s Spooky Author Identification competition, and my latest idea was building a…
Read More »
Python
Mark NeedhamNovember 29th, 2017
0 139
Python: polyglot – ModuleNotFoundError: No module named ‘icu’
I wanted to use the polyglot NLP library that my colleague Will Lyon mentioned in his analysis of Russian Twitter…
Read More »
Python
Mark NeedhamNovember 20th, 2017
0 151
Python 3: TypeError: unsupported format string passed to numpy.ndarray.__format__
This post explains how to work around a change in how Python string formatting works for numpy arrays between Python…
Read More »

Mark Needham

Yelp: Reverse geocoding businesses to extract detailed location information

Leaflet: Fit polyline in view

scikit-learn: Using GridSearch to tune the hyper-parameters of VotingClassifier

scikit-learn: Building a multi class classification ensemble

Python: Learning about defaultdict’s handling of missing keys

Python: Combinations of values on and off

scikit-learn: Creating a matrix of named entity counts

Python: polyglot – ModuleNotFoundError: No module named ‘icu’

Python 3: TypeError: unsupported format string passed to numpy.ndarray.format

Thank you!