November 18th 2008
The book description was really appetizing: Machine Learning applied to the Internet, so it should be easy to understand, and Python as the mean to compute. Unfortunately, contrary to what I saw in different reviews, I was not pleased with the book, and here is why.
Machine learning is a difficult topic. I can understand that there is need for a introductory book. The need is there (it can be seen on the O’Reilly comments of the book). But need does not imply not thorough:
- The tools of the book can be dangerous. Neural networks are regularly debatted, and the litterature is very dense. You cannot expect to understand neural networks with only a chapter. And this applies to every other tool (multidimensonal scaling, …).
- There are no references, no bibliography to help if you encounter a problem. Say you apply neural networks as explained in the book. You get a result you don’t expect. Where do you turn to to have an explanation? In such a book, you have to give references.
Another point is that the code quality is bad. Really bad. If you do machine learning, you use the appropriate Python tools, i.e. Numpy. Besides there are a lot of additional modules to help using machine learning.
I know it’s difficult to write a book and to think about everything: I’ve gone through that path. But if you want to write an ambitious book and if you fail to meet the requirements, you must expect some bad reviews
Such a book should be bigger, far bigger. If you want a real book about Machine Learning, get Pattern Recognition and Machine Learning from Christopher Bishop. And if you want to apply your findings to the web, well, there are a lot of books on web analysis.Tags: Book review, Machine learning, Python, Scientific computing, Web
2 Comments »