-
https://github.com/commonmark/commonmark.js/issues/261
https://github.com/commonmark/commonmark.js/pull/289
https://spec.commonmark.org/0.31.2/#paragraphs
> Leading spaces or tabs are skipped:
…
-
CodeTidy is set to omit embedded HTML from auto-indentation fixes. If there is a script tag within the HTML for javascript then syntax coloring treats this the same as embedded js causing inconsistent…
-
This Issue documents all know issues in the sticky scrolling feature with priorties.
### High
- [ ] [Sticky scrolling should allow to be fed by language-specific info/annotations](https://github.c…
-
As of now, the specification says that "both variables and constants can be named with any Unicode character or string."
It poses a problem because then, variables could be named with the *space* s…
-
Lexer uses Pattern_White_Space unicode property when skipping over trivia. However, when we process string literals with escaped newlines, we only skip ASCII whitespace:
https://github.com/rust-lan…
-
### Description
Large documents need to be chunked otherwise tokens exceeding the model's limit won't be used.
MVP for default word based chunking strategy:
- Use a sliding window approach
- Chunk i…
-
# Embedded Language Indicators for raw string literals
* [x] Proposed
* [ ] Prototype: Not Started
* [ ] Implementation: Not Started
* [ ] Specification: Not Started
## Summary
[summary]: …
-
I have found some interesting properties of the molecule language. I'm not suggesting that you should change the language in any way immediately, just sharing my findings and maybe provide some info i…
-
Could you consider removing the semicolons at the end of the lines or make them optional?
While adding semicolons can make the code appear more rigorous, for a scripting language, ease of use might …
-
### Before Asking 在提问之前
- [X] I have read the [README](https://github.com/alibaba/data-juicer/blob/main/README.md) carefully. 我已经仔细阅读了 [README](https://github.com/alibaba/data-juicer/blob/main/READ…