Cartwright is a data profiler that identifies and categorizes spatial and temporal features. Cartwright uses deep learning, natural language processing, and a variety of heuristics to determine whether a column in a dataset contains spatial or temporal information and, if so, what is specifically contained.
Cartwright was built to automate complex data pipelines for heterogenous climate and geopolitical data that are generally oriented around geospatial and temporal features (think maps and time series). The challenge that Cartwright solves is automatically detecting those features so they can be parsed and normalized. This problem turns out to be quite tricky, but Cartwright makes it simple.
I believe we are just ripe for a new semantic web cycle. This is what comes after AI has had a long enough summer, right?
> Cartwright can easily detect things like country, day, latitude
Can it do so in a normalized way? It'd be great to have it produce an "official type" like this one:
https://www.wikidata.org/wiki/Q34027
Then one can imagine a graph-based feature augmentation with other datasets, and learn a SPARQL query. Linking the lat-long pair in a real-estate dataset to geographical features[1] might give a hedge to predicting the local price map.
[1] from the SPARQL wikidata examples: https://query.wikidata.org/#%23Museums%20in%20Brittany%0ASEL...