Friday, January 2, 2009

CSV Data Mix & Map on Google App Engine

I just recently started to mess around with a deployment of the CSV Data Mix project of mine hosted on the Google App Engine. This is currently being tested, but is active as I test. You can try it out via the link below. 
If you want to give this hosted application a test to see how it works you can do the following.

Mixing Data
  1. Pick your dataset. You can used some of the sample data I've used for testing, for example,  here is a dataset with 500 rows of fake user info.
  2. Determine the field delimiter of your data. If you are using this suggested file via the link in step one, then the field delimiter is a comma. 
  3. Decide if you want the mix to be aware of column headings. When using the sample data, there are headings so choose "Yes". 
  4. Click "Mix Data"
The result will be a response page of plain text containing the contents of the shuffled data. This page can then be saved as your mixed data set. 

Mapping Data
  1. Pick two datasets to compare,  basically you should be interested in determining what the shuffle did to your data. You can used the following sample data: fake info (file 1) and shuffled fake info (file 2).
  2. Determine the field delimiter of the files, this should be the same in both files. 
  3. Click "Map Data"
The result will be a response page of plain text containing the mapping. This result will attempt to show where a cell in file one exists in file two in the following format: (x,y) -> (x,y). To learn more about this mapping structure you can read the CSV Data Mapping post. 

Disclaimer
This application demo on the Google App Engine has just recently been developed and has caused some changes to the way the previous code releases have worked. Please consider this a beta at this time. 

No comments:

Share on Twitter