intel-analytics / ipex-llm

Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, vLLM, GraphRAG, DeepSpeed, Axolotl, etc
Apache License 2.0
6.71k stars 1.26k forks source link

ppml with bigdl 2.0 stable version and features #3335

Open glorysdj opened 3 years ago

glorysdj commented 3 years ago
jason-dai commented 3 years ago

Please break down the list into several milestones, and each milestone corresponds to a version merged into github: 1) Time schedule for merging the milestone 2) What features each version supports

glorysdj commented 3 years ago

PySpark/Spark on Graphene-SGX Milestones

details

  1. Basic PySpark and SQL support features:

    • [x] backward compatibility with Graphene pre-1.2-rc1, bigdl 2.0, spark 3.1.2
    • [x] add simple pyworker feature
    • [x] pyspark sql example tests
    • [x] pyspark sql api tests tests:
    • [x] python helloworld/simple numpy
    • [x] pyspark pi
    • [x] pyspark wordcount
    • [x] pyspark sql basic
    • [x] pyspark sql arrow
    • [x] pyspark sql hive
    • [x] pyspark sql dataframe api
    • [x] pyspark sql sqlcontext api
    • [x] pyspark sql GroupedData api
    • [x] ... sql apis
    • [x] pyspark sql end to end example [1 millon]
    • [x] pyspark sql end to end example [50 millon]
    • [x] pyspark sql end to end example [100 millon]
  2. Basic Spark SQL support features:

    • [x] backward compatibility with Graphene pre-1.2-rc1, bigdl 2.0, spark 3.1.2
    • [x] spark sql example tests tests
    • [x] examples listed here(https://github.com/apache/spark/tree/master/examples/src/main/scala/org/apache/spark/examples/sql)
    • [x] org.apache.spark.examples.SparkPi
    • [x] org.apache.spark.examples.sql.SparkSQLExample
    • [x] org.apache.spark.examples.sql.RDDRelation
    • [x] org.apache.spark.examples.sql.SimpleTypedAggregator
    • [x] org.apache.spark.examples.sql.UserDefinedScalar
    • [x] org.apache.spark.examples.sql.UserDefinedScalar
    • [x] org.apache.spark.examples.sql.UserDefinedTypedAggregation
    • [x] org.apache.spark.examples.sql.UserDefinedTypedAggregation
    • [x] org.apache.spark.examples.sql.UserDefinedUntypedAggregation
    • [x] org.apache.spark.examples.sql.hive.SparkHiveExample
    • X org.apache.spark.examples.sql.SQLDataSourceExample.scala
    • [x] TPCH 1G
    • [x] TPCH 10G
  3. K8S support features:

    • [x] backward compatibility with Graphene pre-1.2-rc1, bigdl 2.0, spark 3.1.2 tests:
    • [x] spark pi
    • [x] spark pi single-executor with spark.kubernetes.sgx.enabled=false
    • [x] spark pi multi-executor with spark.kubernetes.sgx.enabled=false
    • [x] spark pi single-executor with spark.kubernetes.sgx.enabled=true
    • [x] spark pi multi-executor with spark.kubernetes.sgx.enabled=true
    • [x] spark pi single-executor with spark.kubernetes.sgx.enabled=true and driver in SGX
    • [x] spark pi multi-executor with spark.kubernetes.sgx.enabled=true and driver in SGX
    • [x] pyspark sql basic
    • [x] multi-executor with spark.kubernetes.sgx.enabled=true
    • [x] multi-executor with spark.kubernetes.sgx.enabled=true and driver in SGX
    • [x] pyspark sql e2e
    • [x] multi-executor with spark.kubernetes.sgx.enabled=true
    • [x] multi-executor with spark.kubernetes.sgx.enabled=true and driver in SGX
    • [x] tpch
  4. BigDL/Orca support features:

    • [ ] backward compatibility with Graphene pre-1.2-rc1, bigdl 2.0 tests:
    • [ ] bigdl lenet
    • [ ] orca data
    • [ ] orca tf basic_text_classification.py
    • [ ] orca tf transfer_learning.py
    • [ ] BigDL API
    • [ ] Orca API
glorysdj commented 3 years ago

real time ml on Graphene-SGX Milestones

qiyuangong commented 3 years ago

Occlum Spark K8s Image with fork patch

glorysdj commented 3 years ago

define pyspark sql testsets

  1. pyspark examples (including pi, wordcount)
  2. pyspark sql examples (basic, arrow, hive)
  3. pyspark sql api tests (apis)
  4. pysaprk e2e examples (customer_profile 100M)
glorysdj commented 2 years ago

define spark sql testsets

  1. spark-examples (spark pi)
  2. spark-sql-examples(sql_example, sql_RDDRelation=1, sql_SimpleTypedAggregator, sql_UserDefinedScalar, sql_UserDefinedTypedAggregation, sql_UserDefinedUntypedAggregation, SparkHiveExample)
  3. spark sql e2e example
  4. TPCH 1G/10G