Commit Graph

123 Commits

Author SHA1 Message Date
Jeremy Carbaugh
ddd81c96b1 update CountEmitter to take an optional of argument to display 'x of y' 2011-09-14 13:49:23 -04:00
Jeremy Carbaugh
42213ff106 add test for median with an even number of values 2011-09-14 13:23:22 -04:00
Jeremy Carbaugh
7e7893109a median will now return a float with both even and odd number of values 2011-09-14 13:22:52 -04:00
James Turk
b3b491d927 Merge branch 'master' of github.com:sunlightlabs/saucebrush 2011-08-04 11:39:32 -04:00
James Turk
6bb753d2b1 LICENSE 2011-08-04 11:39:28 -04:00
James Turk
7de29b7b92 bump version 2011-06-03 16:29:52 -04:00
Jeremy Carbaugh
662b43b0ed move import 2011-06-03 13:56:37 -04:00
Jeremy Carbaugh
f0543842c1 add RemoteFile object for streaming from URL 2011-06-03 12:52:13 -04:00
Jeremy Carbaugh
b47cc70d92 add basic tests for stats filters 2011-06-03 10:56:16 -04:00
Jeremy Carbaugh
2e6723c15e update stats filters 2011-06-03 10:55:57 -04:00
Jeremy Carbaugh
687679c6e4 stats 2010-08-19 15:18:39 -04:00
Jeremy Carbaugh
2123977895 add CountEmitter and test 2010-07-14 14:54:26 -04:00
Michael Stephens
4c025493b1 improve exceptions 2010-07-01 13:47:22 -04:00
Michael Stephens
e8ce9948de test run_recipe 2010-06-30 15:31:30 -04:00
Michael Stephens
a0d3010c0c test for double done() exception 2010-06-30 15:29:05 -04:00
Michael Stephens
636e17d89c check for run() call on finished recipe 2010-06-30 15:07:21 -04:00
Michael Stephens
e201df8eda include exception data in error stream output 2010-06-30 14:27:57 -04:00
Michael Stephens
54a99d4d05 added basic error stream support 2010-06-30 14:13:31 -04:00
Michael Stephens
c5770668e0 allow first argument of MongoDBEmitter to be a pymongo.database.Database instance 2010-06-30 11:33:58 -04:00
James Turk
559eb725e6 ConditionalPathFilter 2010-06-28 21:03:30 -04:00
James Turk
18a15022a5 Subrecord with non-modifying keys 2010-06-28 15:47:25 -04:00
James Turk
07f50fb6a1 Subrecord improvement and more tests
still has a huge issue in that records *must* have side effects
2010-06-28 15:23:33 -04:00
James Turk
cb29e77a05 initial subrecord filter 2010-06-28 10:43:19 -04:00
Michael Stephens
38096b3903 added LoggingEmitter 2010-06-22 14:19:07 -04:00
Michael Stephens
a08771b3b1 call _process_file on the actual file object instead of the path inside JSONSource 2010-06-18 14:10:24 -04:00
Michael Stephens
07d9725ce4 add validator support 2010-06-18 11:07:40 -04:00
Michael Stephens
aeaa7095cf added UniqueID filter 2010-06-17 17:52:43 -04:00
Michael Stephens
ce55ad0fcf added FileSource and JSONSource 2010-06-17 17:06:28 -04:00
Michael Stephens
172af19014 skeleton setup.py 2010-06-17 11:26:13 -04:00
Jeremy Carbaugh
dcb0287a79 handle conversion of None to null 2010-03-09 14:33:54 -05:00
Jeremy Carbaugh
f2d36f0d04 StringFilter called super on UnicodeFilter. fixed that 2010-03-09 14:33:33 -05:00
Jeremy Carbaugh
96e8dd7b45 do not quote non-string values 2010-03-09 14:32:43 -05:00
James Turk
6ffce4cf6e minor cleanups in emitters 2010-02-21 14:21:19 -05:00
James Turk
a6574ecba7 basic emitter tests 2010-02-21 14:20:48 -05:00
James Turk
13d8d1365b basic source tests 2010-02-21 13:44:39 -05:00
James Turk
de3155cb06 filter test improvements 2010-02-21 12:01:04 -05:00
James Turk
30cb36e17e cleanup of basic filters from writing tests 2010-02-21 00:35:56 -05:00
James Turk
079857c961 added first batch of tests for many of the standard filters 2010-02-20 23:55:27 -05:00
James Turk
b607441494 filters are no longer callables
filters were originally callables, but with the addition of recipe
as a required parameter code no longer looks like

result = filter(filter2(filter3(data)))
--but instead--
result = filter3(filter3(filter3(data, recipe), recipe), recipe)

this looks cleaner when written like:
data2 = filter.attach(data, recipe)
data3 = filter.attach(data2, recipe1)
result = filter.attach(data3, recipe2)
2010-02-20 14:15:07 -05:00
James Turk
c8991869ba gitignore 2010-02-20 11:26:46 -05:00
James Turk
1e4f536636 get rid of multithreading/multitasking.. 2010-02-20 11:25:55 -05:00
James Turk
5421c12acd some cleanups discovered via test-splitter.py failure
* recipe is now a kwarg to Filter.__call__
* recipe.reject_record
* Splitter works with iterables
2010-02-20 11:14:23 -05:00
James Turk
293c7a0a78 remove dotted keys, a terrible idea 2010-02-19 23:10:50 -05:00
Jeremy Carbaugh
2cde3badad add keep_records parameter on FieldMerger to prevent it from removing merged fields 2009-10-09 17:28:41 -04:00
Jeremy Carbaugh
dcda3db140 add replace option to sqliteemitter and add better kwargs to dictreader on csvsource 2009-09-01 17:45:05 -04:00
Jeremy Carbaugh
1d05d434b9 update FieldAdder filter to update field if it already exists 2009-08-14 16:48:25 -04:00
Jeremy Carbaugh
dc1d3eab14 separate Unicode and String filters and fix bug in utils.Files 2009-08-06 14:29:22 -04:00
Jeremy Carbaugh
20a7beebab fix unicode encoding of unicode strings, add delimeter option in CSVSource 2009-07-29 16:51:16 -04:00
Jeremy Carbaugh
2e1480d778 update sqlitesource 2009-07-28 16:44:21 -04:00
Jeremy Carbaugh
71fee9897b add Files to handle opening and iteration over multiple files 2009-07-23 17:44:18 -04:00