Rethink aggregator recipes and the aggregator in general

copied from an old PR...

A bunch of this stuff is TODO, just putting the plans here. Once we merge this PR, I think it would be fine to publish the aggregator. The aggregator is currently in a really messy state and there's no way we can publish it as-is.

This PR is a complete rewrite of the aggregator as an actually generic tool to manipulate tables of numbers. Instead of having one big monolithic function which does things in a fixed pipeline, we allow the user to specify the pipeline of execution. The user also gets a temporary namespace to put dataframes in. We retain the same functionality we had before, but configs will all need to be rewritten. We also unify both input and output sections into the pipeline concept as separate pipeline steps, and do away with the global axis/series/variants definition entirely for flexibility. We also get rid of the meaningless Benchmark class which was created originally to deal with very benchmark-specific things. The structure of that class was a big mess, and while we can still use OOP here, I'm leaning towards making the model as simple as possible

The entire system now isn't completely config-dependent, and we could pass in a deserialized configuration as well, for example. Nesting pipelines is also planned (but I'm thinking about how to implement it exactly).

The aggregator now operates in two big steps:

Read configs and construct pipelines. We transform configurations into Pipeline objects which contain functions bound similarly to those created with functools.partial. We also perform some sanity-checking on configurations here so things don't fail after reading all the dataframes.
Execute pipelines. We now perform the computation on the actual data. Because input sections are just pipeline steps, reading the data actually only happens now, and the entire pipeline is executed. Output sections are also the same way.

Valid pipeline steps are simply annotated functions. Python's inspect module is used to determine (using the annotations) where dataframes should be passed to the functions, and where parameters from the config should be bound. That means minimal effort for writing new valid pipeline steps, and much easier maintenance of the actual implementations which use Pandas (the pipeline only passes dataframes around and doesn't really care about them aside from that.) The main caveat here is that generally, we now want config keys to have underscores to separate words rather than dashes, since it's hard to write functions with parameter names containing dashes in Python.

TODO that really should be done before merging, so we don't lose functionality or have a broken project

[ ] Add examples to this PR
[ ] Decouple binding of inputs/outputs from Pipeline into BoundPipeline. We don't actually want to always bind pipelines to variables - if they're defined inside another pipeline, there actually should be no implicit source/dest of default, either.
[ ] Add input and output pipeline steps. Outputting a table should always be generic, without regard to the actual format of output. Outputting a plot will be done with matplotlib. I don't know if we can easily create plots in Excel output. But we can actually add raw data now, provided that the author of the configuration thinks about doing that (see #49).
[ ] Think about a better name for "precompute" and implement that functionality of setting columns of a dataframe. We can actually divide this into two pieces of functionality:
- more common: set a column to a constant or the result of a "pipeline step which returns a pandas Series". This can fit better with our config, since we can specify a pipeline step like
- set_column: Ratio: ratio_of: # this is like a normal pipeline step but it returns a Series, so we need to know what column to set values: Time columns: [Prefix, Accuracy] reference: [Native-C, ha]
- less common: execute user's python code directly
[ ] Add pipeline step to drop columns
[ ] Add support for nested pipelines. This shouldn't be too hard, and it'll enable us to create the entire pipeline including for nested configs before doing any computations. Basically, it should be possible to create a pipeline step which is a pipeline itself.
[ ] Think about input/output binding functionality. instead of always binding by default, we could actually have (to accommodate multiple inputs/outputs) a default binding to a stack of dataframes, which we push/pop on start/finish of pipeline steps. Dataframe passing and distribution in function arguments won't change.
[ ] Better, more useful debug output and verification of aggregation steps (#12)
[ ] Better function lookup. Currently we just have something which looks at globals(), and that's messy

optional TODO that could be in other PRs

[ ] Add generic JSON parser support - could go in another PR (#37)
[ ] Uncertainties, e.g. with uncertain_panda instead of vanilla pandas (#24)
[ ] Add merge/concatenate/join pipeline steps?

Example

Currently this is a toy example, I haven't written much of the functionality for this rewrite yet, but once that's done, the existing configurations can be examples. I'll try to keep this updated as the PR evolves. For example, I'm still not entirely sure on how the separate buffers should be handled... should they just be space for one reference, or stacks of multiple DataFrame references? Currently, the DataFrame-manipulating functions are completely buffer-unaware, which is probably good. Do we even want these buffer spaces? It's definitely convenient for e.g. pulling in reference data from some other table, or pulling in separate tables and merging them in this config. Need to figure out how to handle it properly.

# the entire file is just one big top-level pipeline executed from top to bottom
- input:
    file: '*.csv'
    format: csv # we could possibly infer this as well in the future
    filter: {} # the same filtering syntax that we had before
# since we're particularly whimsical today, let's rename Size column to some meaningless name
- rename:
    Size: Foo
# we want only reasonable problem sizes
- filter_in: 
    Foo: 50000000
# we want only data we care about
- filter_out:
    Implementation: linpack
# we want to compare against MKL
- set_column:
    Speedup over MKL:
      ratio_of:
        values: Time
        columns: [Prefix, Implementation] # I write the list in this format because it's easier to read here
        reference: [Native-C, MKL]
# create a set of pivot tables. this is a crucial step, otherwise we'd just output one big messy table
- pivot_table:
    values: Ratio
    columns: [Prefix, Implementation]
    index: [Function, Accuracy]
    aggfunc: mean
    variants: [Arch] # this might cause us to create multiple pivot tables
  dest: pivot # save this to a different space than our normal data
# send this stuff to the specified output format
- output: {} # empty dict here for no options... could just leave it empty for null as well
  src: pivot # use the "pivot" buffer as the input for this pipeline step

IntelPython / bearysta