Deep Data Mining Blog: Relational database and data mining algorithms

Friday, September 14, 2012

Relational database and data mining algorithms

Some algorithms may appear prohibitively expensive to perform computationally. One of such examples is finding for each data point in table A the nearest K neighboring data points in table B (shown in the figure below). From the surface, it involves calculating the distances between 2 trillion pairs of data points (1 million times 2 million).

However, with database tricks such as binary-tree index, we can make the above algorithm very efficient. Thus, it is helpful for algorithm guys like mathematicians to understand some database technologies. With those database tricks, we can implement algorithms that may be hard to do otherwise.

10 Most Influential People	Text Files and Oracle DB	Predictive Model vs Rule	Build Predictive Model	About Predictive Model Variable	Logistic Regression
Recency Frequency Monetary Analysis	Unique Identifier in Oracle	Materialized View	Database Link	Calculate Percentage Using SQL	Handle NULL Value
Calculate Cumulative Perentage	Find Score Cutoff Value	Remove Duplicates	Calculate Correlation Coefficients	Oracle vs SQL Server	Random Sampling
Table Insert	Read Only Table	Clustering	Ranking	Find Most Frequent	Median Value
Oracle Source Code	Debug PL/SQL	Hide PL/SQL Scripts	Repair Views	Dump Schema	Move Big Files to Amazon

Popular Topics

Popular Topics

Friday, September 14, 2012

Relational database and data mining algorithms

No comments: