Initial Addition of FusedOps and Kernels Plugin With Model Patcher

This is an initial addition of the FusedOps and Kernels Plugin

Added fused ops and kernels from the https://github.com/unslothai/unsloth repo
- the modeling files with the Apache 2.0 license exclusions are not taken.
ImplementedModelPatcher, our novel solution to introducing fused-ops and kernels without explicit rewrite of modeling functions
- this diverts from unsloth that explicitly rewrites the model files.
- ModelPatcher's design is based on rule-based patching, which makes handling different models easy. All the model forwards that are being patched are tracked.
- ModelPatcher also performs rule-based patching of modeling code that is more efficient sometimes as patching a new forward. For example, to replace the CrossEntropyLoss, we do not want to rewrite a new forward just to change the loss. ModelPatcher can patch this using an intelligent use of importlib.
This PR only contains ModelPatcher' rules for llama and mistral.
This PR only supports only supports auto_gptq

TODO:

Initial Tests on L40

Model	Test	Tokens /s
TheBloke/Mistral-7B-v0.1-GPTQ	No fused-ops/ kernels	2492
TheBloke/Mistral-7B-v0.1-GPTQ	With fused-ops/ kernels	3001