-
# Offline Alternative to Google's Read Along App in Hindi
## Description
Develop an offline application (POC - web) that can display a set of Hindi words and accurately determine if the user has p…
-
# Integration of Training Operator into Intel Cloud Native AI Pipeline
## Executive Summary
This proposal presents a strategic vision for enhancing Intel's Cloud Native AI Pipeline (CNAP), which…
-
# repo链接
https://github.com/THUDM/ChatGLM-6B
https://github.com/mymusise/ChatGLM-Tuning
https://github.com/LianjiaTech/BELLE
## LLM量化
https://zhuanlan.zhihu.com/p/616969812
- [SmoothQuant](htt…
-
# URL
- https://arxiv.org/abs/2306.09782
# Affiliations
- Kai Lv, N/A
- Yuqing Yang, N/A
- Tengxiao Liu, N/A
- Qinghui Gao, N/A
- Qipeng Guo, N/A
- Xipeng Qiu, N/A
# Abstract
- Large Lan…
-
Hi,
Thank you for the code release and the continuous support.
In the main paper (Sec 4.2.3) you mention FT experiments following the FT protocol in "Training
data-efficient image transformers & di…
-
[Cross-document Coreference Resolution over Predicted Mentions](https://aclanthology.org/2021.findings-acl.453.pdf)
=========
## Contribution summary
- Cattan et al. proposed the first end-to-end m…
-
# Motivation
Since ichigo v0.5 will support additional language that will make the traditional t2s obsolete. This is a good chance to introduce a t2s framework that we have full control over.
# Go…
-
When using the default settings of
`stochastic_gradient_descend_hyperparam_optimization(X_train, y_train, init_param_guess=np.array([1.0, 100.0]), max_stagnating_iterations=8,
…
-
# URL
- https://arxiv.org/abs/2411.03350
# Authors
- Fali Wang
- Zhiwei Zhang
- Xianren Zhang
- Zongyu Wu
- Tzuhao Mo
- Qiuhao Lu
- Wanjing Wang
- Rui Li
- Junjie Xu
- Xianfeng Tan…
-
Describe the bug
I am trying to finetune tiiuae/falcon-7b-instruct and I am getting this error.
`TypeError: where(): argument 'condition' (position 1) must be Tensor, not bool`
**To Reproduce**…