If you use the code, may we ask you to please cite the corresponding paper.Fractals
Dimensionality reduction
-
FastMap tar file
In C. Paper:
- Christos Faloutsos and King-Ip Lin,
FastMap: a Fast Algorithm for Indexing, Data-Mining
and Visualization of Traditional and Multimedia Datasets
SIGMOD 1995, pp. 163-174
- Karhunen-Loeve
code, for dimensionality reduction (in mathematica)
- Wrapper
script
for ICA, in R. ( tar xvf; make )
Indexing
-
OMNI tree,
in C++. Implementation by Roberto Figueira Santos Filho. Paper:
- kd-tree in C. Tested on linux and mac-osx.
Do tar xvf; make
Time series
Graph mining on hadoop
-
PEGASUS: graph mining system on hadoop. Large project - the site
has software, manual, papers, instructions video
Other graph mining tools
Miscellaneous
- PCHelp (tar-ed
and gzip-ed) For PC Chairs: helps assign papers to reviewers, given
their preferences
- catbib.bash:
Script to concatenate bibtex files, eliminating duplicates.
Wrapper for
'bibtool'
(available on linux, mac-ports, etc)
-
Anonymizer script - it assigns unique ids to each entry,
for the desired column. In perl ( tar xvf; make)
(Updated Oct. 14, 2017 - has the '-z' flag to start numbering
from zero; updated Sept. 24, 2024, to handle csv files as input)
-
Heat-map plots - ready for log-log scales.
In python, bash and gnuplot; using
code from Danai Koutra.
( tar xvf; make)
- Demo for
Haar wavelets
(create a new python virtual environment; then tar xvf; make).
Last modified: Nov. 22, 2024, by Christos Faloutsos