Baseline 2024
Newly available
Intl.Segmenter.prototype.segment()
ã¡ã½ããã¯ããã® Intl.Segmenter
ãªãã¸ã§ã¯ãã®ãã±ã¼ã«ããã³ç²åº¦ã«å¾ã£ã¦æååãåå²ãã¾ãã
const string1 = "Que ma joie demeure";
const segmenterFrGrapheme = new Intl.Segmenter("fr", {
granularity: "grapheme",
});
const graphemeSegments = segmenterFrGrapheme.segment(string1);
console.log(Array.from(graphemeSegments)[0]);
// Expected output:
// Object {segment: 'Q', index: 0, input: 'Que ma joie demeure'}
æ§æ 弿°
input
åå²ããããã¹ãã String
ã§æå®ãã¾ãã
å
¥åæååã®ã»ã°ã¡ã³ããå«ããå復å¯è½ãªæ°ãã Segments
ãªãã¸ã§ã¯ãã§ãã
// ãã±ã¼ã«åºæã®åèªåå²å¨ãçæãã
const segmenter = new Intl.Segmenter("fr", { granularity: "word" });
// ãããç¨ããæååã®ã»ã°ã¡ã³ããèµ°æ»ããã¤ãã¬ã¼ã¿ã¼ãåå¾ãã
const input = "Moi ? N'est-ce pas ?";
const segments = segmenter.segment(input);
// ãããç¨ããåå²ãè¡ã
for (const { segment, index, isWordLike } of segments) {
console.log(
"segment at code units [%d, %d]: «%s»%s",
index,
index + segment.length,
segment,
isWordLike ? " (word-like)" : "",
);
}
// logs
// segment at code units [0, 3]: «Moi» (word-like)
// segment at code units [3, 4]: « »
// segment at code units [4, 5]: «?»
// segment at code units [5, 6]: « »
// segment at code units [6, 11]: «N'est» (word-like)
// segment at code units [11, 12]: «-»
// segment at code units [12, 14]: «ce» (word-like)
// segment at code units [14, 15]: « »
// segment at code units [15, 18]: «pas» (word-like)
// segment at code units [18, 19]: « »
// segment at code units [19, 20]: «?»
仿§æ¸ ãã©ã¦ã¶ã¼ã®äºææ§
RetroSearch is an open source project built by @garambo | Open a GitHub Issue
Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo
HTML:
3.2
| Encoding:
UTF-8
| Version:
0.7.4