Build an Uberjar

leafgrabber$ lein uberjar

Run it

leafgrabber$ hadoop jar leafgrabber-1.0.0-SNAPSHOT-standalone.jar clojure.main

user=> (use 'cascalog.api)
nil

Run all Attributes

user=> (require '[leafgrabber.free-text.attribute :as att])
nil
user=> (def q1 (att/agg-att-source "data/5000urls"))
#'user/q1
user=> (?<- (hfs-textline "/tmp/hadoop-davidg/test-out-dir") [?uuid ?att ?val] (q1 ?uuid ?att ?val))

