This is a summary of all the Data Big Bang blog articles by subject.
IR
A summary of information retrieval stages and current data science articles.
Fetching
Cleaning/Tidying
Parsing
Handling of Active Content
Main Content Extraction
- Extraction of Main Text Content Using the Google Reader NoAPI
- Voice Recognition + Content Extraction + TTS = Innovative Web Browsing
Language Identification
Security
- Automated Browserless OAuth Authentication for Twitter
- The Python POPO’s Way to Integrate PayPal Instant Payment Notification
APIs and NoAPIs
- Google Search NoAPI
- Exporting StackOverflow users blogs to Excel Hyperlinks
- Extraction of Main Text Content Using the Google Reader NoAPI
- Integrating Google Analytics into your Company Loop with a Microsoft Excel Add-on
Analytics
Voice Recognition and Text to Speech
Policies and Data Issues
Entrepreneurship
Marketing and Sales
Plugins
Big Data Stack
- Using Queues in Web Crawling and Analysis Infrastructure
- Persisting Native Python Queues
- Adding Acknowledgement Semantics to a Persistent Queue
- Esoteric Queue Scheduling Disciplines
Tools
Announcements
Resources
Digital Art by Don Relyea