MaxFrame is a computational framework created by Alibaba Cloud to provide a way for Python developers to parallelize their code with MaxCompute. It creates a runnable computation graph locally, submits it to MaxCompute to execute and obtains results from MaxCompute.
MaxFrame client is the client of MaxFrame. Currently it provides a DataFrame-based SDK with compatible APIs for pandas. In future, other common Python libraries like numpy and scikit-learn will be added as well. Python 3.7 is recommended for MaxFrame client to enable all functionalities while supports for higher Python versions are on the way.
You may install MaxFrame client through PIP:
.. code:: bash
pip install maxframe
Latest beta version can be installed with --pre
argument:
.. code:: bash
pip install --pre maxframe
You can also install MaxFrame client from source code:
.. code:: bash
pip install git+https://github.com/aliyun/alibabacloud-odps-maxframe-client.git
We show a simple code example of MaxFrame client which read data from a MaxCompute table, performs some simple data transform and writes back into MaxCompute.
.. code:: python
import maxframe.dataframe as md import os from maxframe import new_session from odps import ODPS
o = ODPS( os.getenv('ALIBABA_CLOUD_ACCESS_KEY_ID'), os.getenv('ALIBABA_CLOUD_ACCESS_KEY_SECRET'), project='your-default-project', endpoint='your-end-point', ) session = new_session(o)
df = md.read_odps_table("sourcetable") df["A"] = "prefix" + df["A"] md.to_odps_table(df, "prefix_source_table")
Detailed documentations can be found
here <https://maxframe.readthedocs.io>
__.
Licensed under the Apache License 2.0 <https://www.apache.org/licenses/LICENSE-2.0.html>
__.