Author Archives: Thomas Murakami-Brundage

Big Data Analysis Tools/Resources: An Annotated Bibliography

Big Data Analysis Tools/Resources 1.     Chris Stucchio’s blog http://www.chrisstucchio.com/blog/2013/hadoop_hatred.html This blog makes this list primarily for one article: “Don’t use Hadoop – your data isn’t that big”, which provides a guide for deciding whether your data really qualifies as big … Continue reading

Posted in Careers And Work, Data Science, Education, Information Technology, Resource-a-rama | Tagged , , | 2 Comments

SQL Programming Resources: An Annotated Bibliography

SQL Programming Resources 1.     Coursera/Stanford “Introduction to Databases” https://www.coursera.org/course/db One of the biggest names in Massive Open Online Courseware (MOOC), Coursera has a huge database of university level courses. This offering from Stanford University has 9 units devoted to SQL … Continue reading

Posted in Careers And Work, Digital Libraries, Education, Information Technology, Resource-a-rama | Tagged , | 1 Comment

‘Big Data’ Public Databases: An Annotated Bibliography

Big Public Databases 1.     Kin Lane’s Federal Dataset Tool http://federal-agency-dataset-adoption.publicprivatesector.org/index.html Many of the following listings refer to US Federal Government datasets. These are some of the biggest public datasets available. Unfortunately, much of this data is messy, published without much … Continue reading

Posted in Data Science, Digital Libraries, Information Technology, Resource-a-rama | Tagged , | 3 Comments

Database Training Resources: An Annotated Bibliography

Database Training Resources 1.     Coursera/Stanford “Introduction to Databases” https://www.coursera.org/course/db Introduction to Databases from Stanford University was one of the first Massive Open Online Courses (MOOCs) offered by Coursera in 2011, and has remained consistently popular. The course covers database design and … Continue reading

Posted in Careers And Work, Education, Information Technology, Resource-a-rama | Tagged , | Leave a comment

Data Visualization Resources: An Annotated Bibliography

Data Visualization Resources 1.     The St Louis Federal Reserve http://research.stlouisfed.org/ The St Louis Federal Reserve Economic Data series is perhaps the most comprehensive repository of time-series data. It also offers an in-browser, cross-platform, data visualization tool. The time series are … Continue reading

Posted in Careers And Work, Data Visualization, Information Technology, Resource-a-rama | Tagged | Leave a comment

Game Design and Choices of Creation

Concerning my self-imposed goals to write and produce games: Human decision-making is largely fueled by seeking out novel/familiar stimuli, as well as avoiding previous pain points and repeating pleasurable experiences. We will need to keep this paradigm in mind: People … Continue reading

Posted in Gamecraft, Resource-a-rama, Wordplay and Commentary | Leave a comment

Jaro-Winkler in ORACLE and textual fuzzy matching

There is a little-known (and hence heavily under-utilized) function in Oracle 11g and up. This is the Jaro-Winkler algorithm (and the companion algorithm named Edit Distance). The Jaro-Winkler algorithm tells you what level of permutation would be necessary in order … Continue reading

Posted in Data Science, Information Technology, The Cloud, Wordplay and Commentary | Leave a comment