en User talk:LiliCharlie

Chữ nôm

(tagged all chữ nôm characters as language "vi-hani" so that enabled browsers (like mine) may use a Vietnamese font to render them (available fonts are: "Nom Na Tong" and "HAN NOM A" & "HAN NOM B")) (current)

Hi, interesting. Would you have time to do a little explaining of this a bit on the Talk:Chữ nôm page ... sounds useful and and might be notable content in the article itself? In ictu oculi (talk) 16:11, 2 April 2013 (UTC)[reply]

Many thanks for explanation. Btw there has been an edit to your edit here. Cheers In ictu oculi (talk) 01:01, 3 April 2013 (UTC)[reply]

The vi-nom template

It seems that vi-nom is identical to Nom Na Tong, and distinct from the default font:

羅吧固𧵑得𥪝𤄯𠊛与今骨 (default)

羅吧固𧵑得𥪝𤄯𠊛与今骨 vi-nom

羅吧固𧵑得𥪝𤄯𠊛与今骨 Nom Na Tong

Compare the default font to HAN NOM A and HAN NOM B:

羅吧固𧵑得𥪝𤄯𠊛与今骨 (default)

羅吧固𧵑得𥪝𤄯𠊛与今骨 HAN NOM A

羅吧固𧵑得𥪝𤄯𠊛与今骨 HAN NOM B

HAN NOM A has an extremely subtle variation on the third character. ~~HAN NOM B has similar variation on the eighth character.~~ But essentially these three fonts are identical. I suggest dropping both HAN NOM A and HAN NOM B from the vi-nom template. Do you think it would help to replace them with a font that covers CJK-C? Another issue is that the name vi-nom is quite confusing. It looks like a lang parameter, but it isn't. Could we change the name to "Nom" or "HanNom"? Kauffner (talk) 18:03, 6 April 2013 (UTC)[reply]

Of course we could change its name, but only if there’s good reason to do so. I chose the name in accordance with the names of similar templates for Chinese that are typically called like {{zh-zhuyin}}, {{zh-trad}} or {{zh-viet}}. IMO template users should read a template’s documentation before they use it. And vi-nom is not a defined locale that could be used with {{lang}} anyway.

As to the HAN NOM fonts: I don’t like them either, and their style seems to be a mere copy of mainland Chinese Sòngtǐ 宋体 fonts. However I am reluctant to just drop the HAN NOM fonts. We might insert other font family names between Nom Na Tong and those instead, e.g. Sun-ExtB and MingLiu_HKSCS-ExtB. And Andrew’s BabelStone Han contains 456 of the 4149 Extension C characters as well. --LiliCharlie 20:12, 6 April 2013 (UTC)

Although I have all those fonts installed, I can display the notorious 𫋙/càng character only with HanaMinB. In any case, the CJK-C characters are pretty obscure, likely to be used only in the context of "stuff recently added to Unicode". Take a look at Han unification. I used vi-nom to add Vietnamese characters to the charts. Kauffner (talk) 03:37, 7 April 2013 (UTC)[reply]

I have updated the list of fonts in the template {{vi-nom}}. Please keep me informed if this fixes your display problems. If not I might consider reverting to the old template. --LiliCharlie 15:58, 7 April 2013 (UTC)

I think the problem is that "Nom Na Tong" is at the head of the list in vi-nom, so if installed it is always used, even for CJK-C characters, which it does not cover. A possibility is to create a separate template for CJK-C and CJK-D characters that just lists fonts that support CJK-C and CJK-D, and use that for 𫋙 rather than vi-nom. BabelStone (talk) 16:56, 7 April 2013 (UTC)[reply]

Sounds good. So the new template would only list a few SIP fonts. And the day browser/OS support for CJK-C/D is sufficient it could be redefined as {{vi-nom|{{{1}}}}} and thus become synonymous with {{vi-nom}}. Would {{vi-nom-C-D|...} be an appropriate name for the new template, or is {{vi-nom-CJK-C-D|...} a better choice? --LiliCharlie 17:43, 7 April 2013 (UTC)

The new template for Extensions C and D has been created. I have chosen to call it {{vi-nom-CJK-C-D}} and it has already been applied to the two occurrences of the character 𫋙 in the article Han-Nom. Please report if it works for you. --LiliCharlie 02:51, 8 April 2013 (UTC)

Thanks, that works for me (with HanaMinB installed on my computer). "HAN NOM B" (and HanNom-B?) only accidentally includes 106 CJK-C characters because it put a number of unencoded characters in the reserved code points at the end of the CJK-B block and these later became part of CJK-C with the result that HAN NOM B (and HanNom-B?) has completely the wrong glyphs for CJK-C characters U+2A700 through U+2A769. Therefore, HAN NOM B (and HanNom-B?) should be removed from the template list. BabelStone (talk) 10:59, 8 April 2013 (UTC)[reply]

Glad it works for you. I remember having noticed that HAN NOM B’s encoding is faulty, but that was several years ago, and since I had alternative fonts and never liked and used it I later forgot. — HANNOM-B is the font’s PostScript name. I know that only font family names should be used in CSS, and not PS names, but it’s an old habit of mine to list them too, as this won’t do any harm but might improve someone’s display... --LiliCharlie 20:51, 8 April 2013 (UTC)

Great job! It displays for me too. Perhaps you could shorten the name of the template. After all, this one isn't specific to Vietnam. It could be just CJK-C-D. Kauffner (talk) 14:30, 8 April 2013 (UTC)[reply]

Finally! Gives me the feeling I’ve been helpful. — {{vi-nom-CJK-C-D}} is language specific: Notice that I plan to make it synonymous with {{vi-nom}} when it is no longer needed (see above), and if an expanded version of Nom Na Tong or an equivalent font is available, it will come on top of the template’s font list. All CJKV fonts are locale specific. The Japanese wouldn’t want their character for zen displayed as 禅 instead of 禅 (with ㇔㇒ instead of ㇔㇔㇒), nor do Hong Kongers like to see their window character as 窗 for 窗 (with ㇒㇒㇔ instead of ㇒㇇㇔). If you don’t see the difference use the government-official standard fonts 華康標準宋體=DFSongStd from HK and 全字庫正宋體=TW-Sung from TW; for Kǎitǐ 楷體 style I recommend 華康香港標準楷書=DFHKStdKai-B5 and 全字庫正楷體=TW-Kai. — BTW, it’s great that the article Han unification now has Nôm in the comparative table. As my window example showed the Chinese examples should be given by country, as in the Chinese WP, and other information is misleading as well. I hope I’ll find the time to write a versatile JavaScript called Script Display Tester later this month, and then will take care of Han unification. I think I have a good knowledge of local differences in CJKV typography. --LiliCharlie 20:51, 8 April 2013 (UTC)

For many of the examples on the chart, the differences quite trivial. We could remove some of them and focus on those where the differences are more significant. The biggest difference is between the mainland Chinese font and the Taiwanese font. Of course, that's Cold War politics rather than language. One or the other must have decided to create a distinct font so you can use typography to express political loyalty. If we could somehow add a font from the 1920s to the chart, we might have to reinterpret what it means to use "traditional" font. Kauffner (talk) 10:22, 9 April 2013 (UTC)[reply]

Well, Singapore has copied mainland China’s typographical manners for decades, and printed matter from Hong Kong as well as from Malaysia more and more resembles what comes from Běijīng and Shànghǎi, too, so this matter is no longer a battle that’s keeps Cold War alive. — Until ten years ago Taiwanese officials persecuted anyone who dared to publish in simplified characters. That was orthographic Cold War! And that atmosphere certainly had a strong influence on the island’s font scene and Taiwanese art as a whole. Fortunately its rulers have come to reason, and even sanctioned Hànyǔ pīnyīn. Things have become much more relaxed and a little more mixed, and there are now web sites on the mainland like the Buddhist Homeland Shrouded in Mists ;-) that are entirely in traditional characters. To be sure, all this is not intended as a comment on the nature and quality of countries’ political systems, let alone their peoples.

Sometimes the difference between traditional and simplified characters is even more complicated. 薴 níng ‘limonene’ has the simplified form 苧; however 苧 zhù ‘a type of grass’ is a traditional character which has the simplified form 苎. In this case wrong or no use of zh-hant and zh-hans not only causes an unwanted display, but may have strange effects on translation software, conversion software, etc. Try the conversion of 苧 in both directions with BabelStone’s BabelPad. This is a nice example of the effects of Unicode’s Han unification policy and of the necessity—not only for display—to use zh-hant vs. zh-hans markup for Chinese.

If you are seriously interested in the matter of font display we could try font embedding. For a start I could produce a WOFF font that contains just the two glyphs for 𫋙 and ⿰朝乙, so the font file the browser automatically downloads would be tiny. As you know I already have ⿰朝乙 in SVG format which is easy to convert. This technology has several advantages: it is now supported by all major browsers, so you can be way over 90% sure readers view the characters as intended, and it is a real vector font that doesn’t look as ugly as a magnified pixel image, and matches in size with the other characters. I’m not sure if Wikimedia support uploading web fonts though, but if you are interested I will check. --LiliCharlie 21:53, 9 April 2013 (UTC)

If you want to try, that's certainly fine with me. I was thinking the giàu character might look nicer if we photoshopped the background. But otherwise I'm satisfied with how it looks now. I put the article up for DYK, but it was rejected. It seems to me the writing is well above DYK standard. I'm sure it was IIO's carping that killed the nomination. Guy stalks me everywhere, a vindictive Frenchman with too much time on his hands. Kauffner (talk) 09:27, 10 April 2013 (UTC)[reply]

It was rather rude to reject your DYK nomination without any explanation, but I think the reason would be that the article has to be nominated within 5 days of creation (DYK rules), and it was created on 9th March but only nominated on 19th March (sometimes a little leeway is allowed, but 10 days is too long). Wrt web fonts, I think that might be a good idea, but probably would need wider discussion amongst the community. If we could upload suitably licensed web fonts to Commons and use them to display characters not generally catered for out of the box that would be a great improvement for articles which use obscure scripts and characters. BabelStone (talk) 10:36, 10 April 2013 (UTC)[reply]

Nom language warning

Do you know that we already have {{Contains Vietnamese text}}? Kauffner (talk) 15:55, 6 April 2013 (UTC)[reply]

Thanks for the hint, Kauffner. I’m not sure if {{Contains Vietnamese text}} should be changed to tell the user to install Nôm fonts, and delete the not yet functional template {{Contains Nom text}} after it has been replaced by {{Contains Vietnamese text}} on all pages, or if it’s better to keep {{Contains Vietnamese text}} as a general browser support warning and write {{Contains Nom text}} as a font specific template. What do you think? Do we need separate templates for these two things? --LiliCharlie 19:35, 6 April 2013 (UTC)

The Vietnamese alphabet displays for Windows 95 and later. It is only Han-Nom that is an issue. So there is no need for two templates. Kauffner (talk) 19:10, 8 April 2013 (UTC)[reply]

Web Fonts

I've put your test web fonts on my website for testing at http://www.babelstone.co.uk/Fonts/NomWebExtension.ttf and http://www.babelstone.co.uk/Fonts/NomWebExtension.woff as requested (the woff file is there but I can't get any of my browsers to admit that it is there). I'm afraid that I'm not too sure where you would engage the greater community in discussions on the use of web fonts as I tend not to get involved in discussions on policy, etc. Probably best to raise it at WP:TECHPUMP. BabelStone (talk) 22:56, 10 April 2013 (UTC)[reply]

Thank you for hosting my fonts. As your website doesn’t seem to recognize files in WOFF format I have now hosted NomWebExtension.woff at http://typefront.com where it is available for embedding in en.WP at http://typefront.com/fonts/825591377.woff.

This is a first test with embedded fonts. The first and last characters 雨 and 没 are only for reference; the two in the middle are our notorious ⿰虫強 at u+2B2D9 (CJK Unified Ideographs Extension C) and ⿰朝乙 at u+F8000 (Supplementary Private Use Area-A). The first line is an attempt at embedding the WOFF font from typefront.com and in the second line I try to embed the corresponding TTF from babelstone.co.uk.

雨𫋙󸀀没

WP obviously doesn’t allow my style='@font-face {...}' definitions and overwrites them with style="/* insecure input */". This test has failed. --LiliCharlie 01:08, 12 April 2013 (UTC)

Interesting, but perhaps not that surprising. It seems that you would have to get changes made to the MediaWiki software to support web fonts, which I can't imagine would be easy, especially if people have concerns about potential misuse of web fonts for nefarious purposes. BabelStone (talk) 01:49, 12 April 2013 (UTC)[reply]

I don’t dare start a discussion on this topic. First, I am a lover of writing systems, typography and calligraphy rather than a programmer with a highly technical background. And what’s more English is neither my first nor my second language, so I fear that talks might break down because of insufficient linguistic and/or cultural competence on my part. It’s a pity, for so many Wikipedia articles could be drastically improved if only nearly universally supported modern font technology were also supported by MediaWiki and Wikimedia Commons. If you look at HTML/CSS code you will discover that a surprisingly high number of webpages “already” rely on font embedding (i.e. 15 years after its first employment), and as far as I’m aware it works without any security issues that are worth mentioning.

Font trivia

This document (in Vietnamese) says that the Nom Na Tong font is based on characters found in the 1933 edition of a book called Thiền Tông Bản Hạnh (The Origin of Buddhist Meditation) by Thanh Tu Thich. Kauffner (talk) 16:25, 15 April 2013 (UTC)[reply]

Buddhist monasteries all over the Sinosphere are a great source for countless CJKV characters. The monks and nuns kept inventing 漢字 in an effort to render expressions they found in Buddhist texts written in a large number of foreign languages, or to convey the uniqueness and unspeakableness of their mystical experiences. I’m sure that many more characters from Buddhist texts will be added to Unicode over time, but diligent systematic studies are necessary before further proposals can be submitted to the IRG. --LiliCharlie 03:33, 17 April 2013 (UTC)

Another Nom template

There is yet another language template for Vietnamese you might want to check out: {{vie}}. On another issue, I adjusted the {{vi-nom}} template and it now seems to work for CJK-C and CJK-D characters as well. Kauffner (talk) 20:52, 8 May 2013 (UTC)[reply]

Thank you for pointing to {{vie}}. To my mind the template could be improved by adding HanaMinA so there is less mixing of different fonts. For example the (Japanese) character u+8217 舗 of the documentation is displayed with PMingLiU although I have Nom Na Tong and all fonts of the HanaMin series installed. Also 城舗胡志明 (with the said Japanese character) for Thành phố Hồ Chí Minh of the template documentation looks strange to me. Shouldn’t that be written 城舖胡志明?
The {{vi-nom}} template had always worked on my system, and {{vi-nom-CJK-C-D}} was created because you had issues with {{vi-nom}} on yours. I have no means of testing if {{vi-nom}} now works on all platforms. If you are not 100% sure {{vi-nom}} works with Extensions C and D on all systems I still recommend using {{vi-nom-CJK-C-D}}. LiliCharlie 06:29, 9 May 2013 (UTC)

It looks like {{vie}} was copied from Japanese Wiki and 城舗胡志明 is "Ho Chi Minh City" in Japanese. This is a bad example since it is a modern name that was never written in Han or Nom -- unless you count Wiki zh-classical, which gives "胡志明市". I copied the font list from {{vi-nom}} to {{vie}}, so the two templates should have the same output now. The use of a separate Vietnam-oriented template for recently added Unicode characters is a kludge, so IMO {{vi-nom}} is the better solution. The {{vi-nom-CJK-C-D}} template can be renamed and presented as a solution for displaying the CJK-C and CJK-D characters in general, since this problem is not specific to Vietnamese.

On an unrelated issue: Check out the charts at Han unification. I make some changes recently. Kauffner (talk) 09:08, 9 May 2013 (UTC)[reply]

Yes, I felt that {{vi-nom-CJK-C-D}} was a hack from the very start, and this was the reason I wrote that I wished it to be become synonymous with {{vi-nom}} some day. Yet it seems to have fixed your display problems at one time. (I remember BabelStone’s remark that in a character string an Extension B/C character was not displayed for you because the font on your system was already determined by the surrounding characters, or something to this effect. — Please keep in mind that an isolated character may look different from the very same character within a sequence of characters.)

I closely watched any changes to Han unification and I appreciate your edits very much. — Obvious, though not serious, errors are: the code points for u+9913 餓 and u+997F 饿 are reversed and u+7985 禅/禅 has a traditional character equivalent that is not mentioned, u+79AA 襌. These shortcomings are not fundamental though. — I fail to understand the last sentence of the introduction to the examples: Why on earth is there mention of “non-graphical language tag characters ... for plain text language tagging” if using these tag characters is strongly discouraged?

I am in the process of creating SVG images for all the characters in the tables of Han unification and its equivalents in three or four other WP languages. Whenever available images for the six Unicode CJKV chart (or source) locales will be produced.

Generally speaking I think that the article Han unification (which is about how the Unicode Consortium decided to encode characters) might be improved by closely following the Unicode specifications. Illustrative sample glyphs should be given for the same locales as are given in the Unicode charts (China, Hong Kong, Taiwan, Japan, Korea, Viet Nam), irrespective of language or the traditional vs. simplified character distinction. (N.B.: for the sample characters in the two tables of Han unification I can show you that the government official reference fonts of HK (DFSongStd = 華康標準宋體) and TW (TW-Sung = 全字庫正宋體) have clearly different glyphs [at first sight] for at least 10 characters, even though the fonts that ship with Mr Gates’s Windows are somewhat less divergent.) — To closely follow the Unicode standard also means to talk about semantic variants etc. Please be—or get—prepared to talk about how Unicode handles CJKV characters (which is not the same way as they are traditionally treated in East or West Eurasia).

As I am neither a native nor second language speaker of English I am reluctant to make changes to en.WP. Please go ahead and make the changes I don’t dare make. Together we are strong. LiliCharlie 00:02, 10 May 2013 (UTC)

I fixed the mistakes you mentioned at the top of the post, but I don't really understand what you are proposing lower down. What characters do you want to use? There are 26 characters on "language dependent" chart now, which I think is too many. When the differences are slight, there is no great lost in removing a row altogether. Traditional vs simplified is not a true language dependency anyway. A Japanese vs. Chinese variation is likely to be of wider interest. So I think we could boil it down to 10 or 15 rows. I like the "Zen" character issue you mentioned before. We should be able to do some more with that. That's certainly a better talking point than the "grass" character that gets brought up so much.
I too have an ambitious font project: to get the {{lang-vi}} template to apply the best available fonts for Vietnamese. The first step is a font comparison, which I am doing here. Kauffner (talk) 15:00, 10 May 2013 (UTC)[reply]

That’s right, there is no need for a larger number of sample characters. And yes, 禪/禅/禅 is a nice one that can be elaborated on.
The reason I insist on the zh-HK vs. zh-TW distinction is that some characters that are dis-unified in TW and hence in Unicode get re-unified in HK, in a way of speaking. This is the reverse of what happened to 禅/禅. — To get an impression please go to http://glyph.iso10646hk.net/english/download_001.jsp and download the official Hong Kong ISO 10646 reference font DFSongStd/華康標準宋體, and to http://www.cns11643.gov.tw/AIDB/download.do?name=字型下載 and download the official Taiwanese (general) reference font TW-Sung/全字庫正宋體. Then install them and make them (temporarily or permanently) the fonts your browser uses to display text in the zh-HK and zh-TW locales and then have a look at these characters that are already in the tables of the Han unification article:

兌兑稅税 (zh-HK)

兌兑稅税 (zh-TW)

Actually Hong Kong officials seem to refuse to make the BIG-5 distinction that entered into the Unicode standard but was not judged to be a case of compatibility characters for pre-existing encodings by the IRG, because it doesn’t make sense to the experts from Hong Kong (nor to me, BTW) and because their typographic tradition is much closer to that of Mainland China (and maybe also influenced by the comparatively strong Japanese speaking minority of HK, though this seems less important in this context. — To be sure, what I like to call character “re-unification” is more than just a matter of using different glyphs. Rather it’s a silent but official way of saying: “You BIG-5 encoders and Unicoders have overdone the Han dis-unification thing.”

When I said I wanted to closely follow the Unicode Standard I meant that all Unicode Han Database “fields” that have become necessary by the way the Unicode Consortium decided to handle Han characters deserve ~~mention~~ consideration. At this point the complete list of UniHan fields is: kAccountingNumeric, kBigFive, kCangjie, kCantonese, kCCCII, kCheungBauer, kCheungBauerIndex, kCihaiT, kCNS1986, kCNS1992, kCompatibilityVariant, kCowles, kDaeJaweon, kDefinition, kEACC, kFenn, kFennIndex, kFourCornerCode, kFrequency, kGB0, kGB1, kGB3, kGB5, kGB7, kGB8, kGradeLevel, kGSR, kHangul, kHanYu, kHanyuPinlu, kHanyuPinyin, kHDZRadBreak, kHKGlyph, kHKSCS, kIBMJapan, kIICore, kIRG_GSource, kIRG_HSource, kIRG_JSource, kIRG_KPSource, kIRG_KSource, kIRG_MSource, kIRG_TSource, kIRG_USource, kIRG_VSource, kIRGDaeJaweon, kIRGDaiKanwaZiten, kIRGHanyuDaZidian, kIRGKangXi, kJapaneseKun, kJapaneseOn, kJis0, kJis1, kJIS0213, kKangXi, kKarlgren, kKorean, kKPS0, kKPS1, kKSC0, kKSC1, kLau, kMainlandTelegraph, kMandarin, kMatthews, kMeyerWempe, kMorohashi, kNelson, kOtherNumeric, kPhonetic, kPrimaryNumeric, kPseudoGB1, kRSAdobe_Japan1_6, kRSJapanese, kRSKangXi, kRSKanWa, kRSKorean, kRSUnicode, kSBGY, kSemanticVariant, kSimplifiedVariant, kSpecializedSemanticVariant, kTaiwanTelegraph, kTang, kTotalStrokes, kTraditionalVariant, kVietnamese, kXerox, kXHC1983, and kZVariant.

More especially none of the fields that have “Variant” or “Source” in their names can be omitted if we are really talking about Unicode/UniHan, and not Han characters in general. — Rome wasn’t built in a day, and the article can only be developed by and by, too. Han characters are a vast (actually an open-ended) field of research and contemplation.

For your work on the lang-vi template make sure you don’t forget the fonts that ship with Mac OS, and maybe Linux and Android as well. Richard Ishida has created an overview of non-Latin-Cyrillic-Greek Windows and Mac OS fonts by script, and he updates the list when new versions of these two OS’s are launched. The links given in the notes section at the bottom of his page might serve you as a starting point for finding Vietnamese Mac OS fonts. If you have no access to a Mac/Linux/Android OS you could try to ask a friend/someone to send you screenshots or allow you to use their computer for a couple of minutes. LiliCharlie 09:36, 11 May 2013 (UTC)

No, I haven't been purged or anything like that, but thanks for your concern. I took a trip to the delta for a few days. I come back to find my user page missing and the focus of considerable uproar.

Anyway, let's get back to fonts. I downloaded both the Hong Kong and Taiwanese fonts you mentioned above and I added them to the comparison chart in my sandbox. TW-Sung looks all messed up on Google Chrome, so I would advise against its use on Wikipedia. They throw Unicode and Big5 together, so the font has 113,000 character codes -- the opposite of Han unification.

I didn't notice any display problems with DFSongStd, the HK standard font. In the majority of cases, the HK character is identical to the one in the Taiwanese font. In some cases, DFSongStd uses a hybrid of the mainland and Taiwanese character. On Wiki, zh-HK yields the same font as zh-TW. Has HK always had its own fonts? There was no political motive to create a distinct local font under the British, so why would anyone bother? In any case, they obviously have one now. User:Rjanag maintains the {{lang-zh}} template, so we can ask him make appropriate adjustments.

With the fields, I was thinking that we could have a box on the right that gives three or four of the fields for each character. The fields related to nationality strike me as the most relevant. For example, the reader may interpret a row differently depending on whether the character has a Japanese, Korean or Vietnamese field.

{{lang-vi}} should probably make the font look like it does on a Vietnamese site, with Arial or New Times Roman for the alphabetic script and Nom Na Tong for the Han characters.

Do you know what character is being shown on this page? I checked 9.7, but I did not find it. Kauffner (talk) 08:19, 15 May 2013 (UTC)[reply]

The character you were looking for in vain is U+51B7 冷. It’s quite a common one meaning cold (literally and figuratively). Unicode has also encoded a corresponding kZVariant/compatibility character U+F92E.

As regards fonts Hong Kong and Taiwan have different needs since people have to be able to write their own different languages that require different characters — Taiwanese Mǐn Nán vs. Cantonese Yuè. Moreover there are special characters in use for place names and proper names that don’t occur elsewhere. Never forget that even tiny Macao needed 16 characters of their own to be encoded in Unicode, and Singapore contributed 226 own characters. — The names issue sometimes has strange effects: For decades even smaller Mainland Chinese character dictionaries showed the “Japanese-only” character 畑 and defined it as a character used as a Japanese family name or similar. The reason was that Chairman Máo had once met a Japanese who used that character in his name... — For such character borrowings UniHan seems a perfect solution to me.

It should be noted that UniHan unifies characters used 1. in different eras 2. at different places 3. for different languages. — FYI, valid ISO 639-3 codes for Chinese languages are: cdo: Min Dong Chinese; cjy: Jinyu Chinese; cmn: Mandarin Chinese; cpx: Pu-Xian Chinese; czh: Huizhou Chinese; czo: Min Zhong Chinese; gan: Gan Chinese; hak: Hakka Chinese; hsn: Xiang Chinese; ltc: Late Middle Chinese; lzh: Literary Chinese; mnp: Min Bei Chinese; nan: Min Nan Chinese (includes Taiwanese); och: Old Chinese; wuu: Wu Chinese (includes Shanghainese); yue: Yue Chinese (includes Cantonese); zho: Chinese (= ISO 639-1 zh) [macrolanguage that includes cdo, cjy, cmn, cpx, czh, czo, gan, hak, hsn, mnp, nan, wuu, yue]. All these require their own set of characters and may be used at the start of locale codes. — There are also ISO 639-3 codes for older forms of Japanese and Korean, but strangely only vie = ISO 639-1 vi for Vietnamese. I have no idea why.

I very much appreciate your efforts to solve display problems and even show the existing differences between Hong Kong and Taiwanese typographic traditions on your fonts page.

I am almost done with my UniHan SVGs. I have decided to show the characters exactly as they are seen in the Unicode character code chart PDFs (only much enlarged), for up to six IRG sources. I hope I can make a preview page with probably 107 SVGs for 206 Unicode characters available soon. — How do you judge the following sentence linguistically and in content: “The reproduction for purely scientific and informational non-profit purposes of this minimal proportion of the Unicode Character Code Charts which are copyrighted by US based Unicode, Inc. falls within the fair use doctrine of United States copyright law and the fair dealing doctrine and similar doctrines of limitations and exceptions to copyright of other jurisdictions.”? LiliCharlie 23:39, 24 May 2013 (UTC)

Another extension E character

I noticed that the character for phở, the famous Vietnamese soup, is in Extension E. It is 񣂝 (⿰米頗) and is No. 06234 in the proposal. Perhaps you could make another svg? Kauffner (talk) 04:43, 23 May 2013 (UTC)[reply]

Nom Character V04-5055.svg

LiliCharlie 23:39, 24 May 2013 (UTC)

		The Original Barnstar
		That's quite a character! Kauffner (talk) 12:20, 25 May 2013 (UTC)[reply]

I put this thing into the Han-Nom chart already, and my friendly stalker put it into the pho article,[1] which was disconcerting but saves me the trouble of having to do it myself. Kauffner (talk) 12:20, 25 May 2013 (UTC)[reply]

Han-Nom

Someone is trying to get rid of my Han-Nom article again, as you can see here. Kauffner (talk) 05:45, 24 May 2013 (UTC)[reply]

If Wikipedia is to be encyclopaedic the article must stay. LiliCharlie 23:39, 24 May 2013 (UTC)

They could get rid of all the Vietnamese terminology that we've been using for years: Han-Nom, chu Han, Han tu, etc. Then Japan would have "kanji", Korea would have "hanja", but Vietnam would have "Chinese characters." I don't know what to tell them. Kauffner (talk) 15:31, 25 May 2013 (UTC)[reply]

Copyedit

How do you judge the following sentence linguistically and in content: “The reproduction for purely scientific and informational non-profit purposes of this minimal proportion of the Unicode Character Code Charts which are copyrighted by US based Unicode, Inc. falls within the fair use doctrine of United States copyright law and the fair dealing doctrine and similar doctrines of limitations and exceptions to copyright of other jurisdictions.”

I count 56 words, so that's a really long sentence. I'll break it down: "This is a minimal portion of the Unicode Character Code Charts, which are copyrighted by US-based Unicode, Inc. Reproduction for scientific and other non-profit purposes falls within the fair use doctrine of United States copyright law, as well as the equivalent doctrines of other jurisdictions.” Kauffner (talk) 12:41, 25 May 2013 (UTC)[reply]

Another character dilemma

So what do you make of this character: . I got it from Le Chieu Thong, an 18th century king. Is it close enough to 謙 / khiêm or some other character so we can encode it? I found a manuscript of the primary source, a well as a transcription. Where I would I expect to see this character, the transcription has a notation that says "維農貢縣古定社人也". Any idea what that means? Perhaps it says, "This king hated historians, so he used a character in his name that has made it very difficult for anyone to write about him." ☺ (This is a Unicode smiley). Kauffner (talk) 14:07, 25 May 2013 (UTC)[reply]

vi-nom on template

Should {{vi-nom}} be used for Han-Nom on templates?[2] Kauffner (talk) 07:21, 8 June 2013 (UTC)[reply]

Han-Nom

Hi, regarding this edit, can I asked were you contacted by email by User Kauffner prior to this edit?

(diff | hist) . . Han-Nom‎; 05:01 . . (+31,522)‎ . . ‎LiliCharlie (talk | contribs)‎ (Undid revision 563384239 by Gaijin42 (talk) — en.WP (being WP’s most international edition) should not make less distinctions than vi.WP and zh.WP (being the WPs most involved in this subject))

The reason for asking, no reflection on your good self, is that the merger-discussion was a public space discussion. Cheers. In ictu oculi (talk) 05:43, 9 July 2013 (UTC)[reply]

Saying nothing about the vi.wp article (since vi.wp didn't have one, it was translated from English at request of User:Kauffner) but looking at zh:汉喃, what does that really say? that 汉 is 汉, and 喃 is 喃. It isn't much of an article is it? And in any case the issue is English. But anyway, the main question is were you contacted by email prior to the edit above. In ictu oculi (talk) 07:02, 9 July 2013 (UTC)[reply]

No, there hasn’t been any contact. NEVER. If you don’t believe me, go and ask the NSA. — Thank you for denying I am capable of making my own decisions. And for implicitly accusing Kauffner, too. Cheers. LiliCharlie 07:26, 9 July 2013 (UTC)

Okay, sorry to have had to ask, but you can understand that with the edit warring to suddenly have someone else who didn't take part in the merge discussion suddenly make the same edit raises an eyebrow. It's a reflection on the problem, not on your good self. Cheers. In ictu oculi (talk) 08:26, 9 July 2013 (UTC)[reply]

I didn’t take part in that discussion because I am an inexperienced Wikipedian who was uncertain about the rules of the game. You will have observed, however, that I contributed to Kauffner’s ambitious article now and then. And if my memory serves me well it was YOU who wrote on the article’s talk page that my comment on the alleged impossibility of teaching Hán-Nôm should be preserved after the merger. Where is it gone? LiliCharlie 09:07, 9 July 2013 (UTC)

Sorry I can't recall the comment, nor did I implement the merge. Off the cuff I can't see why teaching either classical Chinese or demotic Vietnamese would be impossible, but the issue would be is there a printed source saying teaching classical Chinese is impossible, or a printed source saying teaching Vietnamese demotic script would be impossible. If there's a printed source for either of these statements then they could be added to the existing articles. In ictu oculi (talk) 09:16, 9 July 2013 (UTC)[reply]

I’ve found it; it’s here. — Of course any script (i.e. writing system as defined in Peter T. DANIELS & William BRIGHT (1996): The World’s Writing Sytems) can be written, read and therefore taught. LiliCharlie 12:11, 9 July 2013 (UTC)

My link to the still existing Han Nôm talkpage doesn’t seem to work; it gets redirected to Chữ nôm. Is it possible to redirct an article but not its talkpage? LiliCharlie 12:30, 9 July 2013 (UTC)

The article is a redirect to Chữ nôm, but the talk page is still there under it's old name. It's linked from the merge box at the top of talk:Chữ nôm. As for the sentence you were challenging, after you tagged it as needing a citation, Kauffner removed it, and that wasn't changed when it was merged to Chữ nôm.

As for being new to Wikipedia, sincere discussion is always welcome, e.g. at talk:Chữ nôm. Kanguole 12:43, 9 July 2013 (UTC)[reply]

The Signpost: 10 July 2013

Op-ed: It's time to stop pretending the English-language Wikinews is a viable project

WikiProject report: Not Jimbo: WikiProject Wales

Traffic report: Inflated view counts here, there, and everywhere

News and notes: Wikimedia Foundation Board appoints world expert in women's issues, global south

Dispatches: Infoboxes: time for a fresh look?

Featured content: The week of the birds

Discussion report: Featured article process governance, signature templates, and more

The Signpost: 17 July 2013

WikiProject report: WikiProject Square Enix

Traffic report: Most-viewed articles of the week

News and notes: Wikimedia Foundation's new plans announced

Featured content: Documents and sports

Arbitration report: Kiefer.Wolfowitz and Ironholds case opens; July 22 deadline for checkuser and oversight applications

The Signpost: 24 July 2013

In the media: Wikipedia flamewars

WikiProject report: WikiProject Religion

Discussion report: Partially disambiguated page names, page protection policy, and more

News and notes: Wikivoyage turns ten, but where to now?; Wikipedia Zero expands into India

Traffic report: Gleeless

Featured content: Engineering and the arts

Arbitration report: Infoboxes case opens

Your recent edits

Hello and welcome to Wikipedia. When you add content to talk pages and Wikipedia pages that have open discussion (but never when editing articles), please be sure to sign your posts. There are two ways to do this. Either:

Add four tildes ( ~~~~ ) at the end of your comment; or
With the cursor positioned at the end of your comment, click on the signature button ( or ) located above the edit window.

This will automatically insert a signature with your username or IP address and the time you posted the comment. This information is necessary to allow other editors to easily see who wrote what and when.

Thank you. --SineBot (talk) 16:27, 28 September 2013 (UTC)[reply]

Received pron. section

Hi, thanks for your advice on how to edit the sample RP material into the Received Pronunciation article. I'm afraid I got waylaid by some urgent matters and have only now got going on this. I seem to have taken a few wrong turns wrt your advice, but have a sort of working paragraph now in my sandbox. The audio file should not, I'm sure, produce the screen display that comes up when you click on the link, and I'm not sure the transcriptions are shown in the right place. If you have the time to take a look I'd be grateful. RoachPeter (talk) 12:07, 8 December 2013 (UTC)[reply]

Thanks again for your help. I think it works OK now. RoachPeter (talk) 17:43, 8 December 2013 (UTC)[reply]

December 2013

Thank you for contributing to Wikipedia. We always appreciate when users upload new images. However, it appears that one or more of the images you have recently uploaded or added to an article, specifically User talk:RoachPeter/sandbox, may fail our non-free image policy. Most often, this involves editors uploading or using a copyrighted image of a living person. For other possible reasons, please read up on our Non-free image criteria. If you have any questions please ask them at the Media copyright questions page. Werieth (talk) 21:54, 31 December 2013 (UTC)[reply]

Thanks for help on RP

Hello, thanks for intervening to help in the matter of copyright status of material on Received Pronunciation. I have left a message for you and Wiereth (well, it's more of a cry for help, actually!) on my own Talk page. RoachPeter (talk) 15:32, 1 January 2014 (UTC)[reply]

pêche

Hello, this pronunciation is [pɛːʃ] or [paɪ̯ʃ] ? 166.48.172.215 (talk) 12:02, 8 January 2014 (UTC)[reply]

What I hear is a diphthong of [aɪ̯] type. (Historically in French there was a rule /aj/→/ɛ/, which seems “reversed” here.) – C’est un locuteur québécois, n’est-ce pas? —LiliCharlie (talk) 16:36, 8 January 2014 (UTC)[reply]

This one is [tãpeɪ̯t] or [tãpaɪ̯t] ? 166.48.188.247 (talk) 11:51, 9 January 2014 (UTC)[reply]

If this is my choice I’d say it’s closer to [aɪ̯] than to [eɪ̯]. — Do you know this concise description of the phonetics of French? (The section “L'accent canadien (Québec)” starts on page 28 and the one called “Variantes du canadien” on p. 38; have a look at the illustrations of the vowels and compare them to one another and to other accents. More material on this so-called “natural phonetics” by Luciano Canepari is available for download here.) LiliCharlie (talk) 12:41, 9 January 2014 (UTC)[reply]

This one is hard to know that it's [ivaɛ̯ʁ] or [ivæːʁ]. 166.48.189.78 (talk) 22:28, 9 January 2014 (UTC)[reply]

I hear a centring vowel, not one ending in a front [ɛ̯]. — Why do you write [ʁ] instead of voiceless [χ]? (Cf. your transcription of /-r/ in secondaire.) LiliCharlie (talk) 09:31, 10 January 2014 (UTC)[reply]

Because [ʁ] and [χ] is similar. Maybe tempête sounds like [tãpɐɪ̯t] ? 166.48.185.206 (talk) 01:41, 11 January 2014 (UTC)[reply]

Yes. Please note that if you want narrow “impressionistic” transcriptions your transcribers must all be trained in the same tradition, otherwise they are bound to diverge wildly. — I want you to create a user account and discuss matters on your own talk page. Et si tes connaissances de l’anglais ne suffisent pas à prendre part à une telle discussion on peut délibérer sur ta page de discussion de la Wikipédia française. (Ma langue maternelle est ni celle de Molière ni celle de Shakespeare, alors padonne-moi quelques petites erreurs.) LiliCharlie (talk) 06:26, 11 January 2014 (UTC)[reply]

This is my user account. This pronunciation is [tɛɪ̯t] or [taɪ̯t] ? Fort123 (talk) 16:25, 12 January 2014 (UTC)[reply]

This one is [ɑ̃saɛ̯tχ] or [ɑ̃səɛ̯tχ] ? Fort123 (talk) 02:17, 14 January 2014 (UTC)[reply]

I think that it's [ɑ̃sɐɛ̯tχ]. Fort123 (talk) 14:31, 20 January 2014 (UTC)[reply]

A barnstar for you!

	The Original Barnstar
	Thank you for adding the phonetic Transkription at my Wikipedia entry ! Bernd J. Kröger Bkroeger (talk) 11:34, 19 January 2014 (UTC)[reply]

Oh, wie schmeichelhaft! Danke für die Blumen. LiliCharlie (talk) 11:49, 19 January 2014 (UTC)[reply]

gerne doch! Deine Transkription war total korrekt. In der Tat habe ich als Norddeutscher Das /r/ in Bernd vokalisiert.

Ich finde, das ist aber auch richtig für Standarddeutsch. Tja, mit dem J. : es ist in jedem Fall die korrekte Transkription im Standarddeutschen . Gesprochen wird es im deutschen in der Tat seltener . Ich finde das J. Für mich als Wissenschaftler sehr wichtig , da ich damit von anderen bernd Krögers besser unterscheidbar bin. Nochmals vielen Dank für deine Transkription. Bernd Bkroeger (talk) 20:38, 19 January 2014 (UTC)[reply]

quoted you, so...

I'm gonna get accused of polling maybe, but I quote you in this CfD. I think your comments about national sentiment and expertise and so on were useful; but whether people listen to me or not because I pick apart their faulty logics/perceptions/information at length is dubious; I'm made the issue, as usual, rather than the problem created by someone else who refuses to acknowledge they caused a problem..... it's a long story, and I come off as the bad guy for criticizing someone else's very bad idea and also their abuse of process....how can you criticize a bad idea if not by criticizing it??Skookum1 (talk) 08:14, 21 February 2014 (UTC)[reply]

Clara in Blunderland

Can you look at the discussion and help? Crisco and I cannot seem to understand one another. -- Evertype·✆ 09:47, 6 April 2014 (UTC)[reply]

I have announced removal of the deletion tag the day after tomorrow. LiliCharlie (talk) 14:07, 6 April 2014 (UTC)[reply]

ArbCom elections are now open!

Hi,
You appear to be eligible to vote in the current Arbitration Committee election. The Arbitration Committee is the panel of editors responsible for conducting the Wikipedia arbitration process. It has the authority to enact binding solutions for disputes between editors, primarily related to serious behavioural issues that the community has been unable to resolve. This includes the ability to impose site bans, topic bans, editing restrictions, and other measures needed to maintain our editing environment. The arbitration policy describes the Committee's roles and responsibilities in greater detail. If you wish to participate, you are welcome to review the candidates' statements and submit your choices on the voting page. For the Election committee, MediaWiki message delivery (talk) 14:30, 24 November 2015 (UTC)[reply]

Rendering Chinese characters

Hi there. I saw your comment at module talk ZH regarding the small size of Chinese fonts. If you add the following line to your vector.css file under preferences, you can make template wrapped Chinese text in articles appear how you wish by tweaking the colour/font/size:

span[lang|=zh] { color: purple; font: arial; font-size: 18px}

Cheers, Philg88 ^♦talk 19:49, 25 February 2016 (UTC)[reply]

Oh, I already have a solution for that if proper language markup is used (e.g. {{zh}} or {{lang|zh}} etc.). It's a mix of browser preferences and Stylish which is also CSS based, but not restricted to en.WP. Despite a larger font size I use line-height: 100% for CJK characters, which is enough as I don't expect diacritics for CJK, except maybe for Zhuyin. It is extremely nice of you to care about an old person's eyesight and display problems. Thanks a lot. Love —LiliCharlie (talk) 20:09, 25 February 2016 (UTC)[reply]

You're welcome. Glad you have it sorted! Best, Philg88 ^♦talk 21:47, 25 February 2016 (UTC)[reply]

Nasal Sound

You reverted my changes from Nasal sound to Nasal Emission. But Nasal Sound is a disambiguation page. That is not a good choice for the link. Can you suggest a better way to handle this? Mb66w (talk) 02:10, 22 March 2016 (UTC)[reply]

@Mb66w: Yes I can. Simply link to Nasalization. This should yield acceptable results in all cases, though when it is clear that only consonants are intended you may also link to Nasal consonant, and for exclusively vowels, to Nasal vowel. Love —LiliCharlie (talk) 13:57, 22 March 2016 (UTC)[reply]

Thanks. I have made the changes. Mb66w (talk) 16:04, 22 March 2016 (UTC)[reply]

@Mb66w: And I have made changes to the Nasal sound disambiguation page ("a nasal sound is a sound produced with nasalization"). Thanks for your inspiration. Love —LiliCharlie (talk) 16:31, 22 March 2016 (UTC)[reply]

I made changes to the Nasal disambiguation page also. You may want to take a look at those and align better with what you did in Nasal sound Mb66w (talk) 16:36, 22 March 2016 (UTC)[reply]

@Mb66w: I have completely rewritten the linguistics section of Nasal. Thanks for prompting me. Love —LiliCharlie (talk) 17:45, 22 March 2016 (UTC)[reply]

Mistaken(?) warning

Just a heads-up that you placed a warning on my talkpage for "vandalism" to Chinese language, though the actual edit was by Prisencolin and that too was a mistaken blanking.

BTW, in future, it's generally inadvisable to jump to level 3 user warnings such as {{uw-vandalism3}} right off the bat. Those should only be used as a first warning in instances of clear vandalistic or bad-faith editing, which this was obviously not. Satellizer (´ ･ ω ･ `) 06:29, 26 March 2016 (UTC)[reply]

You're right in all points, sorry. Also my edit summary was a bit strange ("two third if the article"). I seems I was much too hasty. Love —LiliCharlie (talk) 07:41, 26 March 2016 (UTC)[reply]

GA page

Their is no misconception about no accent. The midwest has many. And many different kinds. I never got this accentless style of stuff and see no reason why it should be included, considering the page itself links to the numerous accents on that page. And I really don't even see what this has to do with general american at all. General American isn't the midwest at all. Chrishayes00003 (talk) 16:01, 6 April 2016 (UTC)[reply]

There exists a widespread notion that a Midwest accent is close to or even identical with GA. (See for instance the Urban Dictionary: "General American is also known as the Midwestern Standard..." or dialect blog: "In the narrowest sense, the General American “heartland” is found in a tiny chunk of the midwest.") — Not everybody in the English speaking world knows as much about the American Midwest and its multitude of accents as you do, so it's helpful for lots of users to keep this sourced statement. Love —LiliCharlie (talk) 17:38, 6 April 2016 (UTC)[reply]

What does that have to do with the page though? And it was already stated the exactness is not known on the page. I don't get what this has to do with the midwest at all. Chrishayes00003 (talk) 18:14, 6 April 2016 (UTC)[reply]

A lot of people believe that GA and a Midwestern accent are closely related — see my citations above. This sentence debunks the myth. Love —LiliCharlie (talk) 18:25, 6 April 2016 (UTC)[reply]

And wikipedia is the final authority? Chrishayes00003 (talk) 18:51, 6 April 2016 (UTC)[reply]

Nobody is. Wikipedia is more like a collection of scientific views, conflicting or not. But they should all be reliably sourced. You are welcome to add other scientific views if you can cite such sources. Love —LiliCharlie (talk) 19:05, 6 April 2016 (UTC)[reply]

Yeah, but it just seemed like to me that you wanted to keep that just to counter what some of those ither websites said. Chrishayes00003 (talk) 19:12, 6 April 2016 (UTC)[reply]

What I wanted was to keep a reliably sourced statement alive. Wikipedia is work, and many Wikipedians work hard to find adequate sources. Love —LiliCharlie (talk) 19:28, 6 April 2016 (UTC)[reply]

And what I want to do is remove elements that have nothing to do with GA. It's needless spam. It's not the midwest. And adding how other midwesterners sound contributes nothing to the page. It's already stated within the article page that it has no traceable origins. That is all that needs to be said. Not any of this other useless stuffChrishayes00003 (talk) 17:18, 8 April 2016 (UTC)[reply]

I copied the above discussion to Talk:General American#GA and Midwest accents
where it can be continued with more participants.
Love —LiliCharlie (talk) 17:58, 8 April 2016 (UTC)[reply]

L2 speakers in Japanese

Shouldn't it be 11.5 millions of people instead of 11,500 millions or 11.5 billion of people speaking Japanese as a second language? I really doubt that there are 11.5 billion of people speaking Japanese as a second language in a wold of 7.5 billion of people.--Christophe Hendrickx (talk) 19:24, 6 April 2016 (UTC)[reply]

No, it's only eleven thousand five hundred (or 0.0115 million) L2 speakers. That's why they don't change the total, which is expressed in millions. Love —LiliCharlie (talk) 19:32, 6 April 2016 (UTC)[reply]

I see, then it should be written 0.0115 million as the number of speakers is always expressed in millions for all other languages, and if L2 speakers is sorted in an descending way, Japanese does not appear at a first place. Love,--Christophe Hendrickx (talk) 13:00, 7 April 2016 (UTC)[reply]

Making the table sortable is certainly an irrefutable argument. Thanks for your constructive contribution. Love —LiliCharlie (talk) 13:42, 7 April 2016 (UTC)[reply]

You're welcome, thanks for your kind words!--Christophe Hendrickx (talk) 17:19, 11 April 2016 (UTC)[reply]

Wilson

http://www.dictionary.com/browse/wilson

Hello! I've heard [ˈwɪɫsɐ̃n] at this dictionary, it's wrong? Fête Phung (talk) 15:21, 10 April 2016 (UTC)[reply]

Talk back

Hello, LiliCharlie. You have new messages at Talk:World language.
You can remove this notice at any time by removing the {{Talkback}} or {{Tb}} template.

Hello @LiliCharlie:, I left a reply at the talk page, thank you. (58.164.113.8 (talk) 00:40, 1 May 2016 (UTC))[reply]

Ignoring messages

Why are you blatantly ignoring my responses on the talk page? If you don't want to be involved in the discussion then just say so. I can see that your replying to everyone else's messages on other talk pages but you're just ignoring mine, don't you think I deserve an explanation for your sudden disappearance? I can see you're still active on Wikipedia. (137.147.52.163 (talk) 13:49, 6 May 2016 (UTC))[reply]

I'm not sure whether you didn't see this message which is hard to believe or if it was true that you didn't but I'm going to "ping" you so you see this message, @LiliCharlie:. If you don't want to be involved in the discussion you can say you're not interested so I don't need to keep on checking back to see if you've replied or not. (121.214.104.174 (talk) 04:04, 7 May 2016 (UTC))[reply]

I refuse to get involved in discussions with new IPs who pretend they are the same person as someone else. Love —LiliCharlie (talk) 04:41, 7 May 2016 (UTC)[reply]

Thanks for replying. Well I am not a different person @LiliCharlie:, if you look at the geolocation of my IP address it locates to Victoria, Australia each time, I am not a different person, I am the same user it's just that my IP address always changes every few days and there are users on Wikipedia who I have mentioned this to in order for them to know that they're talking to the same person when I'm talking with them on talk pages. I should've have told you that my IP address always changes but I thought the problem might be resolved in a few days and not a week so I didn't mention it. If you want to check my geolocation there is a link at the bottom of each user page. (121.214.104.174 (talk) 06:07, 7 May 2016 (UTC))[reply]

Hello @LiliCharlie:, could you please stop ignoring my messages I told you that I am the same person. It's absurd to believe that a different IP user would come directly to the World language talk page and leave messages when registered Wikipedia users don't even frequent that page often. I am the same user and I have told you that I geolocate to Victoria, Australia so could you please stop ignoring me, you are being extremely rude. (120.144.46.18 (talk) 00:38, 8 May 2016 (UTC))[reply]

Ignoring messages again

Hello @LiliCharlie:, just so you know I thought the way you treated me was quite unfair and rude. I gave you an explanation but you did not give me a reply. I hope you don't treat other users the same way you did to me in the future. (121.214.149.4 (talk) 09:16, 9 May 2016 (UTC))[reply]

ß

Based on Unicode allocation and it's size ß is an uppercase letter, much like Ð and Þ. It is common to see the letter ß in all caps, especially on sport jerseys (See KIEßLING). In fact, it does not look strange if the letter 'große' is capitalized to 'GROßE.'

Can we remove the fact that ß is "lowercase only" ?? — Preceding unsigned comment added by 2607:FEA8:3CA0:4CC:DDA8:B5B7:8287:C57F (talk) 20:34, 2 August 2016 (UTC)[reply]

No, ß is lowercase and its uppercase counterpart is ẞ. 'große' should actually be capitalised as 'GROSSE' and less commonly (unofficially and strongly criticised) as 'GROẞE', not 'GROßE'. Love —LiliCharlie (talk) 20:51, 2 August 2016 (UTC)[reply]

P.S.: Here is a scan from an English document of 1586 in which the words witness, assuring, thankfulness, goodness and blessings are written as witneße, aßuring, thankfulneße, goodneße and bleßings. (The scan is from this blog.) As you see ß is merely a ligature of ss. Love —LiliCharlie (talk) 21:08, 2 August 2016 (UTC)[reply]

P.P.S.: Also have a look at Arthur Guinness's signature in this photo. (It's something like Arth Guinneſs — or Arth Guinneß by German or older English standards.) Love —LiliCharlie (talk) 21:23, 2 August 2016 (UTC)[reply]

ArbCom Elections 2016: Voting now open!

Hello, LiliCharlie. Voting in the 2016 Arbitration Committee elections is open from Monday, 00:00, 21 November through Sunday, 23:59, 4 December to all unblocked users who have registered an account before Wednesday, 00:00, 28 October 2016 and have made at least 150 mainspace edits before Sunday, 00:00, 1 November 2016.

The Arbitration Committee is the panel of editors responsible for conducting the Wikipedia arbitration process. It has the authority to impose binding solutions to disputes between editors, primarily for serious conduct disputes the community has been unable to resolve. This includes the authority to impose site bans, topic bans, editing restrictions, and other measures needed to maintain our editing environment. The arbitration policy describes the Committee's roles and responsibilities in greater detail.

If you wish to participate in the 2016 election, please review the candidates' statements and submit your choices on the voting page. MediaWiki message delivery (talk) 22:08, 21 November 2016 (UTC)[reply]

?

(ns[r.107]+ns[r.108])/2=5

What is "r."? Please reply on my talk page.68.150.86.232 (talk) 11:55, 15 December 2016 (UTC)[reply]

Mandarin as official in Hong Kong and Macau

Hello, I've looked at the links that you provided from both the Hong Kong and Macau governments regarding official language policy in the two territories. However, their constitutions only provide Chinese (ambiguously mentioned) as an official language alongside their historic colonial languages. In practice, Cantonese is the de facto working language of the government and Mandarin is hardly ever used, only when officials from the Beijing government are involved, hence when simultaneous interpretation in Mandarin is used. Having lived in Hong Kong and visited Macau multiple times, Mandarin is not as omnipresent as you'd expect or want it to be much to the CCP's dismay.

When official emergency announcements (not routine) are provided on transit, media, etc. in Chinese, Cantonese is the only variant used. Under the one country, two systems policy, Chinese equates to Cantonese in these territories (although recent broken promises by China prove to be a challenge). Therefore, Mandarin has a position similar to what Spanish has in the U.S. or French in the Canadian province of Ontario, a minority language that's provided due to the large number of users present in the territories but not official, although in this case, visitors rather than residents. -- User:Moalli (talk) -- 08:19, 9 January 2017 (UTC)[reply]

Edit: I'd like to add that government media in Hong Kong (RTHK) and Macau (TDM) have their free to air Chinese networks in Cantonese and their primary Chinese education curriculum in Cantonese. A link from the HSK, the official Mandarin standardized test sponsored by the Chinese government, also identifies Cantonese as the spoken form of Chinese in these territories [3]. Give this ambiguity in the Basic Laws, it'd make just as much sense to add Shanghainese or Hokkien as official languages since they fall under the "Chinese language" umbrella. -- User:Moalli (talk) -- 09:00, 9 January 2017 (UTC)[reply]

This is indeed a strange definition of official language, which has nothing to do with a majority of people who speak a certain vernacular, nor with schooling or the media, but a lot with certificates, forms and other documents issued by and used in governmental offices — hence the word official. Even the laws in HK and MO are laid down in Standard Chinese (plus English and Portuguese respectively).

An official language is not at all the same as a national language or a majority language, witness the situation in Namibia and many other countries. Love —LiliCharlie (talk) 10:24, 9 January 2017 (UTC)[reply]

Oh, okay I totally understand where you're going now. Yes, official documents in the two regions are laid down in standard Chinese, but it refers to standard written Chinese rather than a specific spoken vernacular. Makes this ambiguity even more confusing. While Cantonese itself does have a colloquial written form, it has not been completely standardized and hence, the written form is standard vernacular written Chinese, which coincidentally matches with Putonghua perfectly. Cantonese speakers still read in their own variant with these characters, although it will sound very formal. In that case, might a suggestion be to still remove the two countries from the spoken Standard Chinese infobox but mention their use in inter-governmental affairs with Beijing using the sources that you have provided? -- Moalli (talk) 05:18, 10 January 2017 (UTC)[reply]

If the article Standard Chinese was only dealing with standard spoken Chinese I might agree. But it obviously doesn't. Love —LiliCharlie (talk) 18:52, 10 January 2017 (UTC)[reply]

Actually, it seems that it does. Again, there is a difference between spoken 'standard' Chinese as defined by the PRC (Putonghua) and standard written Chinese (not spoken) that is used by speakers of any Chinese variant and on official HK/Macau documents. There was even a debate on the talk page about defining what standard Chinese vs Mandarin is when regarding this article. Initially, these two territories were excluded from the infobox for a reason until someone with 'Marxist' in his username decided to tag them on it. In order to provide a factual and neutral POV, it would be best to only mention the territories in the body with the sources since they the language isn't de jure or even de facto official as their Basic Laws leave it to ambiguity. -- Moalli (talk) 02:00, 11 January 2017 (UTC)[reply]

The article Standard Chinese clearly covers the written language. It does so in the introduction and even has a dedicated section Writing system. Love —LiliCharlie (talk) 18:22, 11 January 2017 (UTC)[reply]

The main article links to written Chinese, which definitely is not the same as spoken language. From article: The writing system for almost all the varieties of Chinese is based on a set of written logograms that has been passed down with little change for more than two thousand years. Each of these varieties of Chinese has developed some new words during this time, words for which there are no matching characters in the original set. It does not mention the role of Putonghua at all. Any variant of Chinese is written using this system, which just happens to match how standard Chinese is spoken. However, thank you for putting this to debate on the article page. -- Moalli (talk) 02:14, 12 January 2017 (UTC)[reply]

Again this is not true. Not all varieties of Chinese are written using Chinese characters. — If you are not an expert then why don't you take a few minutes to check what you claim to be facts? Love —LiliCharlie (talk) 18:10, 12 January 2017 (UTC)[reply]

P.S.: The Chinese equivalent of Standard Chinese is not 普通话 or Putonghua, but (现代)标准汉语. Please remember that Mandarin is a pluricentric language — like English, BTW. Love —LiliCharlie (talk) 20:26, 12 January 2017 (UTC)[reply]

List of territorial entities where German is an official language

I didn't come across D-A-CH either until XING (and others), but makes some sense to me (not completely, as plates for Austrian are in fact not the expected German Ö). ※ Sobreira ◣◥ (parlez) 22:33, 13 January 2017 (UTC)[reply]

I don't think this is the right place to introduce the abbreviation, cf. Wikipedia:What Wikipedia is not#Wikipedia is not an indiscriminate collection of information. It is not correct to randomly insert information from sources that have nothing to do with German as an official language; D-A-CH simply doesn't seem to be a technical term in this field of knowledge. (BTW the term Dachsprache is important for the article Standard German which covers standardized varieties of German from Germany, Austria and Switzerland, but this is mere coincidence since the word is derived from German Dach "roof" + Sprache "language.") — If you would like to provide information about number plates you can edit one of the articles Vehicle registration plate, Vehicle registration plates of Europe or List of international vehicle registration codes. (International vehicle registration codes always consist of up to three basic Latin letters A–Z without diacritics, so Ö would not be "expected," but illegal.) Love —LiliCharlie (talk) 08:39, 14 January 2017 (UTC)[reply]

Writing system

Thanks, but the major issue is that the Spanish article material copied is sourced to a book published by SAIS, the Scientific Atlantology International Society. Doug Weller talk 10:04, 14 January 2017 (UTC)[reply]

Right. Plus "Paleolithic linear writing (ELPA)" is far too old even for proto-writing which is disputable by nature. Love —LiliCharlie (talk) 10:42, 14 January 2017 (UTC)[reply]

(not so) recent changes at standard chinese phonology and pinyin

Hello. It seems that User:向日葵的夏天 has made certain changes that brought the transcription at Standard Chinese phonology and pinyin back to the complicated old transcription and away from Duanmu, Lin and Lee & Zee. Do you think we should keep his changes? I've avoided reverting further to prevent an edit war.--Officer781 (talk) 00:50, 5 February 2017 (UTC)[reply]

Hi Officer781, what I want is 1. a uniform transcription in all en.WP articles that 2. conforms to IPA usage.

Here is a citation from the section The Principles of the International Phonetic Association on page 159 of the Handbook of the International Phonetic Association:

"4 The construction and use of the IPA are guided by the following principles:

(a) When two sounds occurring in a given language are employed for distinguishing one word from another, they should wherever possible be represented by two distinct symbols without diacritics. Ordinary roman letters should be used as far as practicable, but recourse must be had to other symbols when the roman alphabet is inadequate."

It seems the transcription system 向日葵的夏天 has introduced constantly violates IPA Principle 4 (a) since neither the diacritics nor some special non-roman letters are necessary. Love —LiliCharlie (talk) 13:51, 5 February 2017 (UTC)[reply]

I'm copying this conversation over to standard chinese phonology and getting the editor involved to discuss.--Officer781 (talk) 15:34, 5 February 2017 (UTC)[reply]

Revert

Hej LiliCharlie! Concerning this revert and the message you left on anonymous' talk page: wnen I look at the image, the word reads utilißimae. Not exactly what anonymous had written, but also not what you reverted to. Richard 09:49, 23 February 2017 (UTC)[reply]

PS: I had the file renamed on Commons. Richard 10:15, 23 February 2017 (UTC)[reply]

Sorry, I just read the file name without actually scrolling down to the image so I thought the intended word was ultißimae “the very last ones (feminine).” Thank you for renaming the file. Love —LiliCharlie (talk) 12:55, 23 February 2017 (UTC)[reply]

I figured as much. Perhaps it woud be an idea to review your remark on anonymous' talk page? Richard 13:24, 23 February 2017 (UTC)[reply]

Done. I have deleted that edit. Love —LiliCharlie (talk) 13:38, 23 February 2017 (UTC)[reply]

Thank you ;) Richard 15:14, 23 February 2017 (UTC)[reply]

German alphabet

Hello ! I just can see that you are among the last contributors to our German orthography article. I find it wrong , to state that letters as Ä, Ö and Ü are considered "special" in the perspective of that article. I have not removed anything, but added. I think it's polite to alert you , since I can see that the last changes is a year old. I'm not looking for any warring, but am looking for the truth, including sources. Cheers ! Boeing720 (talk) 00:58, 13 March 2017 (UTC)[reply]

Can you give more evidence about the deletion of my edit in "CJK Unified Ideographs"?

You said that U+2011E 𠄞 is a variant version of 上, and is not equivalent to U+4E8C 二. Can you give any link or extra examples to explain that? -- Albert Micah Hang (Talk) 11:06 Beijing Time (UTF+8), May 29th, 2017 —Preceding undated comment added 03:05, 29 May 2017 (UTC)[reply]

Hi Albert Micah Hang, you can look up the character at http://www.zdic.net/z/84/js/2011E.htm or at http://www.guoxuedashi.com/zidian/z28073t.html for instance. In the famous 說文解字 dictionary (which was written in seal script) both are radicals: 𠄞 shàng is radical 2 while 二 èr is radical 479. For an overview of the Shuowen radicals see also The 540 Shuowen Seal Radicals - numbered.svg. Love —LiliCharlie (talk) 17:23, 29 May 2017 (UTC)[reply]

Hi LiliCharlie, I have looked at your comment and seen the links you give. In these links, they are the same. HOWEVER, I have searched a couple of other websites that writes the radical 2 as 丄, like [4] and [5], and their shape are NOT the same as yours. Albert Micah Hang (talk) 00:08, 30 May 2017 (UTC)[reply]

Hi Albert Micah Hang, and thank you for your interesting links. My software version of the Shuowen dictionary which aims at reproducing the original text by 許慎 has 𠄞 shàng as radical 2, and these are the 10 characters and variant characters grouped together under this radical:

《說文解字‧注》 by 段玉裁 is cited as the source for this enumeration of characters which correspond to the more "modern" Hàn characters 𠄞、丄(上)、帝、𢂇、㫄(旁)、𣃟、𣃙、雱、𠄟、丅(下).

In ancient times the two strokes of the character 二 èr were of equal length, but later the upper stroke became shorter than the lower one leading to confusion with 𠄞 shàng. Maybe the reason that in later Shuowen editions 丄 shàng (the second character of my illustration) was sometimes or even usually taken to represent radical 2 was to avoid this confusion.

Anyway there is no chance that at the time the CJK Unified Ideographs Extension B block was added to Unicode in March 2001 someone thought the character 二 èr was still missing from Unicode. It had already been a Unicode character since June 1992, of course. And it is also not conceivable that all the reviewers of the CJK Unified Ideographs Extension B proposal confused this extremely simple two-stroke character with any other character. It is clear that 𠄞 (U+2011E) was intended for something else, namely shàng, and this is the way Chinese researchers and dictionary makers use this Unicode character. Love —LiliCharlie (talk) 17:01, 30 May 2017 (UTC)[reply]

Thanks for deletion about Putonghua, but You would better write me first, so I would redress my own error.

Thanks a lot, You so quickly find semantic typo about Putonghua! I fixed. I hope this one is all right. --PoetVeches (talk) 15:46, 18 September 2017 (UTC)[reply]

ArbCom 2017 election voter message

Hello, LiliCharlie. Voting in the 2017 Arbitration Committee elections is now open until 23.59 on Sunday, 10 December. All users who registered an account before Saturday, 28 October 2017, made at least 150 mainspace edits before Wednesday, 1 November 2017 and are not currently blocked are eligible to vote. Users with alternate accounts may only vote once.

The Arbitration Committee is the panel of editors responsible for conducting the Wikipedia arbitration process. It has the authority to impose binding solutions to disputes between editors, primarily for serious conduct disputes the community has been unable to resolve. This includes the authority to impose site bans, topic bans, editing restrictions, and other measures needed to maintain our editing environment. The arbitration policy describes the Committee's roles and responsibilities in greater detail.

If you wish to participate in the 2017 election, please review the candidates and submit your choices on the voting page. MediaWiki message delivery (talk) 18:42, 3 December 2017 (UTC)[reply]

Supporting Zhuang Characters

It is important than Zhuang characters, or Sawmdip, be supported. Zhuang characters like chữ nôm are use CJKV ideographs to write a non Chinese language for hundreds of years. In Unicode 10 over one thousand Zhuang characters were added. Wikipedia uses Unicode when characters are in Unicode for text. Images of characters in text should be restricted to characters not curremtly in Unicode. Happy to work with you or others on how to solve problems with respect to Zhuang Characters on wikipedia pages.Johnkn63 (talk) 05:03, 30 January 2018 (UTC)[reply]

@Johnkn63: Well, I have five fonts installed on my system that support the character 𭨡 (U+2DA21). They are BabelStone Han, HanaMinB (=ﾔ\ｾｩB), TH-Khaai-PP2, TH-Khaai-TP2, and TH-Tshyn-P2. However very few of our users will have even one of them installed on their respective systems. As far as I can tell no major OS ships fonts that cover CJK Unified Ideographs Extension F yet. Besides, Unicode 10 isn't even close to supporting the full range of Sawndip characters. For example, several characters of File:Universal Declaration of Human Rights Zhuang Sawndip.png and of File:Universal Declaration of Human Rights Zhuang Sawndip Traditional variant.jpg are still not supported, which is the very reason Sawndip doesn't show up on our Help:Multilingual support (East Asian) page. So at this point it's a futile endeavor to "solve problems with respect to Zhuang Characters on wikipedia pages." And since we can't recommend users a general viable solution to overcome problems with Sawndip characters I am convinced that we should at least provide them with graphics as substitutes instead of leaving them with tofu. Love —LiliCharlie (talk) 14:23, 30 January 2018 (UTC)[reply]

Thank you for pointing out the need to update the Help:Multilingual support (East Asian) page. I have now updated this. File:Universal Declaration of Human Rights Zhuang Sawndip.png was produced about a decade before Unicode 10 and as such does not show the progress that has been made. File:Universal Declaration of Human Rights Zhuang Sawndip Traditional variant.jpg was made by transposing the characters in the former file, it contains characters I have not seen in any Zhuang document. A character not displaying tells a person they need to install another font, which is not a bad thing. Whilst many more Zhuang characters should be added to Unicode (around a further 1,700 have been submitted to IRG already). The number of chữ nôm characters submitted to IRG last year was twice that of Zhuang characters but we do not say replace encoded chữ nôm with images.Similarly we should support encoded Zhuang characters. Johnkn63 (talk) 05:28, 1 February 2018 (UTC)[reply]

Striking

Just FYI, the <strike> element hasn't existed in HTML for a long time. It's <s> now. Using the long form triggers our HTML "lint" cleanup filters, and such. I.e., it's stuff we'll need to cleanup later as we migrate to HTML5. Happily, the valid version is shorter and more convenient. The <del> element also works, for semantic difference from <ins>. — SMcCandlish ☏ ¢ 😼 01:18, 7 July 2018 (UTC)[reply]

Mandarin

Hi! I understand mandarin is a branch of Chinese. And the branchese of mandarin (Putonghua, Taiwan Guoyu, Singaporean/Malaysian Huayu) is implied by using the single term without annotation. As Spanish is not being noted as including Chilean, Mexican, Iberian branches. LUMINR (talk) 11:18, 15 November 2018 (UTC)[reply]

@LUMINR: The term Mandarin may refer to a branch of Chinese that covers many varieties not all of which are mutually intelligible, see Mandarin Chinese. Mandarin may also refer to the “standard literary and official form of Chinese”, for which see Standard Chinese. The term requires disambiguation, as on our page Mandarin. Love —LiliCharlie (talk) 11:33, 15 November 2018 (UTC)[reply]

Thank you for clearifying :) Is there a common term for the mutually intelligible, what general people think when they hear "Mandarin" ? LUMINR (talk) 12:12, 15 November 2018 (UTC)[reply]

No, that wouldn't make sense: Variety A may be mutually intelligible with both B and C, but B not with C, etc. See article Dialect continuum. Love —LiliCharlie (talk) 12:37, 15 November 2018 (UTC)[reply]

ArbCom 2018 election voter message

Hello, LiliCharlie. Voting in the 2018 Arbitration Committee elections is now open until 23.59 on Sunday, 3 December. All users who registered an account before Sunday, 28 October 2018, made at least 150 mainspace edits before Thursday, 1 November 2018 and are not currently blocked are eligible to vote. Users with alternate accounts may only vote once.

The Arbitration Committee is the panel of editors responsible for conducting the Wikipedia arbitration process. It has the authority to impose binding solutions to disputes between editors, primarily for serious conduct disputes the community has been unable to resolve. This includes the authority to impose site bans, topic bans, editing restrictions, and other measures needed to maintain our editing environment. The arbitration policy describes the Committee's roles and responsibilities in greater detail.

If you wish to participate in the 2018 election, please review the candidates and submit your choices on the voting page. MediaWiki message delivery (talk) 18:42, 19 November 2018 (UTC)[reply]

Warning

https://en.wikipedia.org/wiki/File:Nasal-ingressive_voiceless_velo-pharyngeal_trill.svg

This might be a copyright violation; I wouldNT upload it to Wikipedia if I were you. IPA editor (talk) 20:58, 7 January 2019 (UTC)[reply]

@IPA editor: This is not copyright violation, see commons:File talk:Nasal-ingressive voiceless velo-pharyngeal trill.svg. If you still think it is you are invited to join that discussion. Love —LiliCharlie (talk) 05:48, 8 January 2019 (UTC)[reply]

@LiliCharle: Oh, ok. Revert my edits if you like, feel free. IPA editor (talk) 13:02, 8 January 2019 (UTC)[reply]

Thank you for reverting unlogged IPA editor's vandalisms in Help:IPA/Italian! Would you like to join a discussion I've opened in the Linguistics talk? Iuscaogdan (talk) 19:02, 12 January 2019 (UTC)[reply]

On abbreviation in German

On the -bğ combination: unfortunately, I can not upload the files with this elements, but it is possible to find some maps in free access. I've seen at least 3 maps of different periods (19th century, mid-50s and mid-90s with such an abbreviation for -burg). That is, this is a common practice. Is it enough for the article? 83.149.240.100 (talk) 15:04, 8 February 2019 (UTC)[reply]

No, that's original research which Wikipedia articles must not contain. We don't create news and report things that experts on the subject don't confirm. — The subject here is the institutionally regulated German orthography, and the breve is not part of the inventory of symbols that the legally competent institution recognises as belonging to German orthography (Greek orthós "correct" & gráphein "to write"). Love —LiliCharlie (talk) 15:31, 8 February 2019 (UTC)[reply]

Here is an example with an interletter breve: Saarb˘g, Gusenb˘g, Freudenb˘g. I've found this map today, in the W-Media. It's only one of them... I can suspect that this abbreviation derivates from the tradition of written u with a breve-like symbol. ^[1] 83.149.240.100 (talk) 15:34, 8 February 2019 (UTC)[reply]

There are some things, that (maybe) weren't mentioned in Duden an so on. I'll try to find them. But if the thing exists during some long period, can we say that it's an original research? If it is not a part of orthography, were should we place this piece of information on the writing system? 83.149.240.100 (talk) 15:38, 8 February 2019 (UTC)[reply]

I think, we can place it in the article on the breve sign. And mention the traditional handwritten form of u (e.g. in Kurrent (Kurrentschrift)) as a possible source of appearance. 83.149.240.100 (talk) 15:49, 8 February 2019 (UTC)[reply]

A cartographic representation of words is not the same thing as their orthographic representation.
As an occasional user of Sütterlin I see no relation between Kurrentschrift's graphic representation of u and the breve. In Kurrentschrift the ˘-like shape is part of the basic letter u itself, not a diacritic that modifies a letter. This letter shape doesn't alter the sequence of orthographic symbols.
Please refrain from engaging in original research which has no place in this encyclopaedia. We report what reliable third-party published sources have reported rather than the beliefs or experiences of our editors, see Wikipedia:Verifiability. Love —LiliCharlie (talk) 16:06, 8 February 2019 (UTC)[reply]

As an occasional user of Kurrent I see some relation, because breve is used to make the contrast between u and n visible: it's a significant part of letter used to distinguish two different graphemes. 83.149.240.100 (talk) 18:00, 8 February 2019 (UTC)[reply]

References

^ https://upload.wikimedia.org/wikipedia/commons/5/58/Karte_des_Deutschen_Reichs_%28Vogel%29_-_Section_22_-_Strassburg.png

List of languages by number of native speakers

Hello LiliCharlie, Thank you for editing the page List of languages by number of native speakers. I looked at the number and couldn't find a solid source. Can you organize somehow that all are coming one source? If you haven't added a source. please do so as soon as possible.

Thank you Qian Nivan (talk) 17:33, 22 February 2019 (UTC)[reply]

I'd like to...

I'd like to work together with you on phonology to make the introduction reflect content of the second paragraph. At the moment the introduction contradicts the second paragraph. — Preceding unsigned comment added by Kelly222 (talk • contribs) 05:50, 12 March 2019 (UTC)[reply]

Hindi (incl. Urdu as a variety in India)

@LiliCharlie: Please see the Dialects section in this link: https://www.ethnologue.com/language/hin

Where it says, "Khari Boli (Dehlavi, Kauravi, Khadi Boli, Khari, Khariboli, Vernacular Hindustani). Formal vocabulary borrowed from Sanskrit, de-Persianized, de-Arabicized. Literary Hindi, or Hindi-Urdu, has 4 varieties: Hindi (High Hindi, Nagari Hindi, Literary Hindi, standard Hindi); Urdu [urd]; Dakhini; Rekhta. Hindustani, though not listed separately in India, refers here to the unofficial lingua franca of northwest India. Has a lexical mixture in varying proportions of Hindi (vocabulary derived from Sanskrit) and Urdu (vocabulary derived from Persian or Arabic)."

So clearly Urdu is listed as a 4th variety of literary Hindi. 2607:9880:4038:B:51D3:E0A3:7839:252C (talk) 14:08, 24 March 2019 (UTC)[reply]

I'll reply on the article's talk page at Talk:List of languages by total number of speakers#Hindi (incl. Urdu as a variety in India). Love —LiliCharlie (talk) 16:35, 24 March 2019 (UTC)[reply]

Spanish (and French) language maps

LiliCharlie - First, I would like to thank you for uploading my map into WikiCommons and making it easily accessible from the "World Languages" article.

'NothingHam Bread' has retyped his requests in my personal Talk-File, and I have added more comments: see https://en.wikipedia.org/wiki/User_talk:DLMcN#World_language_maps.

With regards, --DLMcN (talk) 04:40, 5 April 2019 (UTC)[reply]

Traditional Chinese characters

Hey- I saw you deleted the sentence about the "semi-simplifed" form of the word 'Taiwan'. There are some assertions made in that same paragraph that are given no citation, so I would invite you (as a person that deletes uncited material) to take a second look at that paragraph and delete some more if you are interested. If it were me, I would probably just keep the whole paragraph, which I generally agree with, but that's because I don't really know how to pick apart what really needs citation and what kind of doesn't. Geographyinitiative (talk) 08:38, 13 May 2019 (UTC)[reply]

Cantonese

I raised the issue at ANI a few weeks ago, which helped for a while, but here we are again, so I've re-raised it at Wikipedia:Administrators' noticeboard/Incidents#Cantonese again. By the way, users are only pinged if they are mentioned in your initial signed comment. Adding them in a later edit doesn't generate a ping. Kanguole 22:19, 30 June 2019 (UTC)[reply]

A small request

Hi. Could you replace the vowel chart on User_talk:JackintheBox#NZ_English_dress_vowel with [6]? I've replaced it on New Zealand English phonology and on Asturian and Spanish Wikipedias and I'm about to make a delete request on Commons. The chart is not only unreliable (they don't cite sources, /e/ is too low on it, it shows /ɵː/ and /ʉː/ as too close to each other and the corresponding diphthong charts based on the same source show /æʊ/ and /ɐʉ/ as [æɔ] and [ɑɵ], which isn't supported by the literature), but it's actually been deleted from the site sometime (I guess a month or two) ago, along with the diphthong chart.

The new chart actually illustrates your point more strongly, as far as I can see. Kbb2 (ex. Mr KEBAB) (talk) 15:17, 16 July 2019 (UTC)[reply]

Done. Love —LiliCharlie (talk) 15:55, 16 July 2019 (UTC)[reply]

ANI notice

There is currently a discussion at Wikipedia:Administrators' noticeboard/Incidents regarding an issue with which you may have been involved. The thread is Continued disruption at Cantonese, again, again. --Blackmane (talk) 14:13, 2 August 2019 (UTC)[reply]

Discussion Invitation

Hi there, LiliCharlie! I don't think we've ever crossed paths. I'm KyleJoan. Please to make your acquaintance! I'm writing because I saw that you contributed to this thread discussing the sorting of a list involving an American person with a surname of French origin. This very discussion is still active over at Talk:The Real Housewives of New York City, and I would love for you to chime in if you would be so kind. Thank you very much, and have a wonderful rest of your week! KyleJoan^talk 05:46, 7 August 2019 (UTC)[reply]

Thanks

@LiliCharlie: By the way Manipuri is not local name. Manipuri describes people living state Manipur,India. All the people in Manipur doesn't speak this language (Meitei language),and there are many languages in Manipur. Adding Manipuri language sometimes confused with Bishnupriya Manipuri language And many people in state Manipur don't want to called Meitei language as Manipuri,since Manipuri is English term derived from Sanskrit origin Manipur - Jewel land. Awangba Mangang (talk) 01:16, 12 August 2019 (UTC)[reply]

German spoken by italians.

Yes I mean the South Tyroleans. They are Italians, too. And it is their first and official language. Why did you revert it? — Preceding unsigned comment added by Paradise Chronicle (talk • contribs) 20:10, 31 August 2019 (UTC)[reply]

@Paradise Chronicle: In the infobox of article German language this appears under the "Ethnicity" header, so Italians would mean ethnic Italians, not people having Italian citizenship or nationality. Also note that Italy already appears three times in the infobox: implicitly as "Region: Worldwide", and explicitly as "Official language in: 3 dependencies: South Tyrol (Italy)" as well as "Recognised minority language in: 11 countries: Italy".

German-speaking South Tiroleans are mentioned in the last sentence of the introduction to article Austrians as a "closely related" but separate ethnic group; the article's infobox mentions only 16,331 Austrians in Italy. This means that according to en.WP standards South Tiroleans should probably be added to the list of ethnicities in the German language infobox. You are invited to do this job. — Do you know someone who is able to start a separate article South Tyroleans (which is currently a redirect to South Tyrol) about the German-speaking ethnic group (as opposed to the Italian- and Ladin-speaking ethnicities of the autonomous province)? Love —LiliCharlie (talk) 23:10, 31 August 2019 (UTC)[reply]

Yes, I have seen this. I just have been to Switzerland and noticed there they don't speak German in their free time and the German tourists to switzerland usually also don't understand the Swiss people speaking swiss. But In South Tyrol they do speak German in their free time and the germans do understand them. This were my thoughts why I included Italy. But I have also understood it is not an official language from a country in Italy but only of a dependency, and therefore it is ok for me, like it is now. All the best. Paradise Chronicle (talk) 06:15, 1 September 2019 (UTC)[reply]

Are you playing a Wikipedia dictator?

...by choosing to remove facts about Noto fonts from reliable sources? 89.12.60.198 (talk) 16:47, 19 October 2019 (UTC)[reply]

This must be a rhetorical question that you don't expect me to answer. — BTW, I am looking forward to the reaction of other Noto fonts editors. Love —LiliCharlie (talk) 17:09, 19 October 2019 (UTC)[reply]

ArbCom 2019 election voter message

Hello! Voting in the 2019 Arbitration Committee elections is now open until 23:59 on Monday, 2 December 2019. All eligible users are allowed to vote. Users with alternate accounts may only vote once.

The Arbitration Committee is the panel of editors responsible for conducting the Wikipedia arbitration process. It has the authority to impose binding solutions to disputes between editors, primarily for serious conduct disputes the community has been unable to resolve. This includes the authority to impose site bans, topic bans, editing restrictions, and other measures needed to maintain our editing environment. The arbitration policy describes the Committee's roles and responsibilities in greater detail.

If you wish to participate in the 2019 election, please review the candidates and submit your choices on the voting page. If you no longer wish to receive these messages, you may add {{NoACEMM}} to your user talk page. MediaWiki message delivery (talk) 00:14, 19 November 2019 (UTC)[reply]

Google Code-In 2019 is coming - please mentor some documentation tasks!

Hello,

Google Code-In, Google-organized contest in which the Wikimedia Foundation participates, starts in a few weeks. This contest is about taking high school students into the world of opensource. I'm sending you this message because you recently edited a documentation page at the English Wikipedia.

I would like to ask you to take part in Google Code-In as a mentor. That would mean to prepare at least one task (it can be documentation related, or something else - the other categories are Code, Design, Quality Assurance and Outreach) for the participants, and help the student to complete it. Please sign up at the contest page and send us your Google account address to google-code-in-admins@lists.wikimedia.org, so we can invite you in!

From my own experience, Google Code-In can be fun, you can make several new friends, attract new people to your wiki and make them part of your community.

If you have any questions, please let us know at google-code-in-admins@lists.wikimedia.org.

Thank you!

--User:Martin Urbanec (talk) 21:58, 23 November 2019 (UTC)[reply]

Colognian tone

Hey. Did we ever decide how to transcribe T1 vs T2? E.g. maybe a circumflex for T1 and either just a stress mark or some other tonal diacritic, maybe a grave, for T2, like [zɛ̂i] (T1) vs [zɛ̀i] (T2), or [ˈzɛ̂i] vs [ˈzɛi]. — kwami (talk) 22:26, 12 December 2019 (UTC)[reply]

Good question. Special diacritics for both pitch accents are traditionally used in orthography-based Serbo-Croatioan linguistic works (and are conflated with marking vowel and syllabic liquid length), whereas one of the two pitch accents of Swedish-Norwegian is viewed as the unmarked form and traditionally has no special symbol in phonemic transcriptions. I prefer the latter choice, unless someone convinces me that two dedicated pitch-accent symbols are genuinely helpful for our users. Love —LiliCharlie (talk) 01:15, 13 December 2019 (UTC)[reply]

A Joyous Yuletide to you!

Merry Christmas and a Prosperous 2020!

Hello LiliCharlie, may you be surrounded by peace, success and happiness on this seasonal occasion. Spread the WikiLove by wishing another user a Merry Christmas and a Happy New Year, whether it be someone you have had disagreements with in the past, a good friend, or just some random person. Sending you heartfelt and warm greetings for Christmas and New Year 2020.
Happy editing,
JACKINTHEBOX • ^TALK 08:17, 23 December 2019 (UTC)[reply]

Spread the love by adding {{subst:Seasonal Greetings}} to other user talk pages.

ISO 3166-1 and Taiwan

I disagree your edit summary that citing CNS would be original research. [7] How about moving long materials to ISO 3166-2:TW?--Jusjih (talk) 23:53, 27 January 2020 (UTC)[reply]

@Jusjih: All right, I see you've made the changes to ISO 3166-1, ISO 3166-2:CN, and ISO 3166-2:TW. Strictly speaking however, only the first two of those articles are about Taiwan as an entity, namely ISO 3166-1 country code TW, and ISO 3166-2 country subdivision code CN-TW. Please note that the topic of article ISO 3166-2:TW is not Taiwan as a whole but rather its subdivisions. Also note that the ISO 3166-2 subdivisions TW-KIN designating Kinmen, and TW-LIE designating Lienchiang refer to territories that the PRC does not recognize as being part of their hypothetical Taiwan Province, but part of Fujian Province. In other words the territorial scope of ISO 3166-1 TW is one that only the ROC approves of, whereas the PRC contadicts this way of subdividing Taiwan, no matter what designation ISO 3166-1 code TW is given. Going a step further this means that if you start discussing disputes in ISO 3166-2:TW it would be fair and appropriate to mention that dispute as well. See also Wikipedia:Neutral point of view. Love —LiliCharlie (talk) 12:32, 30 January 2020 (UTC)[reply]

Kinmen and Matsu were omitted from ISO 3166-2:TW as evidenced in ISO 3166-2 Newsletter I-4 (2002-12-10). They were added on 2015-11-27. Added your requested dispute.--Jusjih (talk) 00:38, 31 January 2020 (UTC)[reply]

Carefully read https://www.iso.org/obp/ui/#iso:code:3166:TW to see under "Additional information": "Administrative language(s) alpha-2: zh; Administrative language(s) alpha-3: zho; Local short name: Taiwan". Be bold but not reckless. [8] Want me to upload cropped screenshots of that page to better explain the story? Neutral point of view may mean that all ISO 3166 pages should be politically neutralized.--Jusjih (talk) 04:03, 3 February 2020 (UTC)[reply]

Exactly. It's a local short name, not a name of the international standard ISO 3166 that we are discussing. A textbook case of WP:NOTEVERYTHING. Love —LiliCharlie (talk) 06:12, 3 February 2020 (UTC)[reply]

Then how should we talk at Talk:ISO 3166-1 alpha-2 and Talk:List of ISO 3166 country codes to enforce WP:NPV and WP:NOTEVERYTHING about "common names" not always acknowledged by ISO? List of ISO 3166 country codes overlaps ISO 3166-1 and List of sovereign states matching zh:世界政區索引 with ISO codes as secondary information. It is theoretically possible to merge List of ISO 3166 country codes into List of sovereign states to be more flexible with naming countries and areas, but this requires wide discussion.--Jusjih (talk) 04:24, 5 February 2020 (UTC)[reply]

Tamazight

Hello there! Thank you for your work. I was wondering if you can add a language to the lists, or the lists are from other sources. (The language I am talking about is Berber "Tamazight", the native language of North Africa. Kamaqdouf (talk) 19:33, 29 February 2020 (UTC)[reply]

Hi Kamaqdouf,

there are several languages that carry the name Tamazight in Ethnologue's classification of Berber languages. The one with most reported speakers is Central Atlas Tamazight with about 5 million total speakers. Which of our lists of languages do you mean? I am afraid that neither our list of languages by total number of speakers nor our list of languages by number of native speakers is long enough for Central Atlas Tamazight to be included. Love —LiliCharlie (talk) 23:32, 29 February 2020 (UTC)[reply]

List of languages by number of native speakers (again)

This list be updated according to Ethnologue (2020, 23rd edition). The new edition has been published.182.186.63.233 (talk) 04:58, 1 March 2020 (UTC)[reply]

I would also like this to happen. However, most Ethnologue webpages are now behind a paywall, and I only have access to their free What are the top 200 most spoken languages? page with figures for total number of speakers. (After inputting one letter in the search field a list of all top 200 languages whose name starts with that letter is shown, so 26 searches should cover them all.) Love —LiliCharlie (talk) 10:37, 1 March 2020 (UTC)[reply]

Please update according to 23rd edition 2020. 182.186.34.48 (talk) 16:18, 9 March 2020 (UTC)[reply]

My reasons for the transcription of the phoneme ʌ to ɐ and why it is not vandalism

My reasons why my edit is not vandalism is that the edit I did shows that the transcription of the phoneme ʌ to ɐ was done to provide accurate information that corresponds to the transcription of English in IPA format. Although the Lua programming language does not show this, my transcription is strongly valid because the way we pronounce words normally transcribed with the vowel ʌ sounds a lot closer to the vowel ɐ. Another reason why my edit is not vandalism because the page Help:IPA/English also shows that some dictionaries transcribe ɐ as ʌ and other pieces of evidence that show that ʌ might not be an accurate transcription when it comes to the pronunciation of words like cut, runt, shrug, etc. Love—(User talk:Jarabalistic Woofenmoton746) 19:03, 17 March 2020 (UTC)[reply]

ISO 3166-2 code "GB"

ISO 3166-2 I read the page, sorry, I put it like the four countries in one on the Uk article. The article says 3 nations, which I didn't find on the link, is that somewhere else? 80.233.32.213 (talk) 20:08, 2 May 2020 (UTC)[reply]

No, https://www.iso.org/obp/ui/#iso:code:3166:GB is the only authoritative source for ISO 3166 code GB. Anything that is not mentioned there should not be in article ISO 3166-2. There is a lot more to report on the UK, of course, but that is off topic. — For the controversial status of Northern Ireland as a province, region, country, nation or something else, see Northern Ireland#Descriptions. Love —LiliCharlie (talk) 21:06, 2 May 2020 (UTC)[reply]

Phonetics vs. Phonology

Moved to Talk:Phonetics § Phonetics vs. phonology

Hi. Thanks for your help. This is my concern. I feel that the first sentence should tell us specifically what phonetics is. The average person reading the Phonology page and the Phonetics page would think they are the same thing, based on the first sentences:

"Phonology is a branch of linguistics concerned with the systematic organization of sounds in spoken languages..."

"Phonetics is a branch of linguistics that studies the sounds of human speech..."

They sound the same to me.

I suggest you amend the first sentence using the information in the second paragraph. So, it would start with:

Phonetics is a branch of linguistics that broadly deals with two aspects of human speech: production—the ways humans make sounds—and perception—the way speech is understood. In the case of sign languages, the equivalent aspects of sign would apply.

OR, even better:

Phonetics is a branch of linguistics that broadly deals with two aspects of human speech: the ways humans make speech sounds, and the way speech is understood. In the case of sign languages, the equivalent aspects of sign would apply.

The second paragraph can be changed a little to avoid repetition.

The Phonology page should change to. I suggest:

Phonology is a branch of linguistics concerned with the systematic organization of sounds (i.e. the patterns) in human languages; including phonetics, the ways humans make speech sounds and the way speech is understood. Cheers. John (talk) 20:31, 10 June 2020 (UTC)[reply]

Reversion on Flemish

Hi there,

I see you reverted my edit on Flemish. If you consult File:BelgieGemeenschappenkaart.svg, you will see that the summary notes the color as "gold" and "golden", not as "olive green", which is a completely different color. I would appreciate a compromise that would be suitable; that color does not look olive green to me at all.

Thank you, The Obento Musubi (talk) 21:35, 1 July 2020 (UTC)[reply]

Hi The Obento Musubi, I followed your proposal and consulted File:BelgieGemeenschappenkaart.svg where it says the following web colours are used in the map: #A0A000, #A00000, and #0000A0. Here are a few named web colours for comparison: gold; yellow; olive. (See the list of web colour names at Web colors#X11 color names or the graphic File:SVG Recognized color keyword names.svg.) To my eye #A0A000 and olive look very much alike, except that olive is a bit darker, and quite unlike gold and yellow. How about calling #A0A000 "light olive" and/or using colour boxes as I did here? Love —LiliCharlie (talk) 06:50, 2 July 2020 (UTC)[reply]

Hi LiliCharlie, I really appreciate your adding color boxes. I think that is the most suitable solution. The colors used in the maps were a bit odd. Thank you! The Obento Musubi (talk) 01:16, 19 July 2020 (UTC)[reply]

Hi The Obento Musubi, thanks for showing your appreciation. — On 2 July I also left a message on commons:File talk:BelgieGemeenschappenkaart.svg inviting users to join this discussion, but no one reacted. Do you think we should change the colour name in the English description of the map file? And if so, change it to "light olive" or something else? Love —LiliCharlie (talk) 06:15, 19 July 2020 (UTC)[reply]

Hi there LiliCharlie, I think this may be one of those black/blue or gold/white dress situations. This website describes #A0A000 as "Dark yellow (olive tone)", and this website describes #A0A000 as "Citrus", and this website describes the color as "a medium dark shade of yellow-green". The really difficult thing about this color (see this website) is that it seems that it's 50% red, 50% green, and 0% blue as far as RGB percentages (which probably explains why you see it as a shade of olive green), but by CMYK percentages, the color is 100% yellow + 37% black, which is probably why I saw it as a muddled gold color. In any case, the color is not a web-safe color, so I think, ideally, the original file should be modified to be a web-safe color; that way, the color would be easily describable. The Obento Musubi (talk) 19:39, 23 July 2020 (UTC)[reply]

As long as the colours are distinct and a matching key is provided the purpose of the diagram is (in my opinion) achieved. Of course there is the issue of device ability to render colours and personal perception of colours when it comes to naming. Personally I would see Brown, Blue and Green - but does it matter in this context. 109.180.186.210 (talk) 10:21, 3 December 2021 (UTC)[reply]

World languages

Good afternoon Lili -

I am puzzled (and intrigued) by your statement: [World Language is] "an article on how the term world language has been used in academic literature". It does of course make it clear that I was quite wrong to write: The key question here is: "Which languages should be listed as having world status, distinguishing them from just the 'supra-regional' ones?"

However, I am not the only editor with the impression that an essential feature of the article - is to present examples of languages with [what could be described as] "World status", followed by others which "do not quite make the grade". You must have noticed that Taryan recently promoted Hindi/Urdu into the second category.

Frankly, I am not personally bothered by the fact that Spanish has been pushed down into the second category. It is true that it is (now) my second language, but for most of my life it was not. You may recall that, not too long ago, I intervened quite forcibly to retain French in second position above Spanish.

Thus, I was only trying to be helpful when I posted: Spanish seems to feature well in two portions of the globe ... so I am reluctant to 'demote' [it]. It is misleading and unfair to say: DLMcN thinks that "Spanish is indisputably dominant in the western hemisphere (and thus is widely taught in the USA, Brazil, and in non-Hispanic islands of the Caribbean), and is important in western Europe, etc. etc." - because < these are facts which are so obvious that it seems superfluous to insist on sources to back them up.

Asking an admin to ban all further discussion of this topic, is surely not the best way of resolving the matter? Rather, it is going to need a complete restructuring and rewrite of the article - to reflect your description of it (quoted at the beginning of this post).

With regards, --DLMcN (talk) 16:34, 1 August 2020 (UTC)[reply]

Hi DLMcN, two of Wikipedia's principles are: Do not combine material from multiple sources to reach or imply a conclusion not explicitly stated by any of the sources. And: Even if you are sure something is true, it must be verifiable before you can add it. It is therefore pointless to discuss if some claim is true or not. Our task is not to discover "the truth," but to report what reliable sources explicitly state, without synthesis of published material (i.e., without drawing conclusions from other facts or claims). — Besides, we don't rank languages in article World language. All we say is that there is a group of languages that are called world languages by the authors of nearly all reliable sources, another group by many authors, and yet another group by some authors. And we don't say that any of those author(itie)s is better or worse than their colleagues, either. — If I was an acknowledged expert I would certainly count Spanish among the world languages. But that doesn't matter here. Love —LiliCharlie (talk) 17:51, 1 August 2020 (UTC)[reply]

Thank you Lili ... After a short exchange of views with IamNotU - see https://en.wikipedia.org/wiki/User_talk:IamNotU#Another_James_Oredan_sockpuppet?_-_or_not? > no. 87 - [admittedly starting with a completely different topic] - I suggest that we should perhaps not have three separate sections, but simply run the whole list through without any dividing lines. We could maintain the present ordering - but emphasise that the 'ranking' presented, was only approximate - i.e., not meant to be taken too literally. Selected comments after each language [based on sources if necessary] would then give some idea of how widely they were spoken, taught and used. I mentioned your name because I thought you would probably have access to references 1, 2, 29 and 30. Cheers --DLMcN (talk) 20:18, 25 August 2020 (UTC)[reply]

Noto IPA

Hi. Another slight problem in e.g. ⟨ᶜ̧ ᵊ˞ ⟩, the modifier letter is composed of finer lines than usual, while the diacritics have normal thickness, which looks odd when they intersect. (It also looks odd with diacritics that don't intersect, but less obviously.) But that's really only a problem if you wanted to use Noto for printing, which it's not really designed for. For the internet it's good enough. — kwami (talk) 10:21, 22 August 2020 (UTC)[reply]

Spot-on

You are absolutely right RE: abugida. This was based on a misreading on my part. Anti-Anti-Vaxxer2 (talk) 14:55, 28 August 2020 (UTC)[reply]

A barnstar for you!

	The Original Barnstar
	Spotted an error, repaired it speedily Anti-Anti-Vaxxer2 (talk) 14:57, 28 August 2020 (UTC)[reply]

Wording on Han unification

Hi @LiliCharlie: I absolutely agree with what you said when you reverted my edit. Though, what I had meant was that the significantly-different characters are the simplified and Japanese ones, while the compatibility ideographs were "similar variants". Unfortunately, I did not word that clearly and I am not very familiar with Unicode (thanks for linking to the relevant pages!), so it seems that came across wrong. If you know how to properly explain what I was trying to say, that might be helpful, or it could just be left off. I'll switch the character in the traditional column of the chart back to the traditional version since I think that edit is appropriate (do correct me if I'm wrong though) and leave the paragraph as is. Thanks again, ChromeGames923 (talk · contribs) 21:47, 25 October 2020 (UTC)[reply]

Hi ChromeGames923, the characters U+F908 龜 and U+F907 龜 were encoded for what is known as "round-trip compatibility" with existing encoding schemes, so that any text can be converted from other encodings to Unicode and converted back to the original encoding without loss. However in Unicode itself they are considered equivalent to U+9F9C 龜 rather than "similar variants". This means, among other things, that fonts and renderers are not required to distinguish between them. To give you an idea, the following fonts on my system have three identical glyphs for them: AR PL New Kai, BabelStone Han, Batang, BatangChe, Code2000, DFHKStdKai-B5, DFPHKStdSong-B5, DFKai-SB, Dotum, DotumChe, Gulim, GulimChe, Gungsuh, GunsuhChe, HanaMinA, Han-Nom Minh, HYHaeseo, Kurinto fonts in 44 styles, Malgun Gothic, Meiryo, Meiryo UI, MOESungUN, New Batang, New Gulim, TH-Jeong-H, TH-Jeong-J, TH-Ming-HP0, Yu Gothic, and Yu Gothic UI. As you can see, this listing includes fonts from (and for) six major locales: Mainland China, Taiwan, Hong Kong, Japan, South Korea, and Vietnam. (And it seems our Wikimedia software converts U+F908 龜 and U+F907 龜 to U+9F9C 龜, which is no problem for characters that are not the result of conversion from other encodings. Or is it my browser that does the conversion?) — You simply cannot expect the three to be visually different without language mark-up, and the necessity for such mark-up is already discussed in the article, and amply exemplified in the chart of section "Examples of language-dependent glyphs". — The simplified characters U+9F9F 龟 (Mainland China, Singapore, Malaysia) and U+4E80 亀 (Japan) are already in the chart of the "Examples of some non-unified Han ideographs" section. Love —LiliCharlie (talk) 23:06, 25 October 2020 (UTC)[reply]

CSS-generated uppercase ẞ

At Talk:ß#Decisive_arguments, you write “... this CSS-generated ß looks like two S's ... and you won't be able to highlight one of the S's individually”. That is true on my computer, too. (Although it displays it as if it highlighted first the first S, and then the other with its adjacent space). But where is that defined? I searched those listed in Help:Cascading Style Sheets that I thought might apply (i. e. common/shared.css, monobook/main.css, MediaWiki:Common.css, MediaWiki:Monobook.css), but can't see it. Another thing I found odd but might understand if I had access to the CSS is that the ß copies as a simple uppercase “S” via clipboard. ◅ Sebastian 23:36, 3 November 2020 (UTC)[reply]

The CSS-3 specification says: "The ‘text-transform’ property only affects the presentation layer; correct casing for semantic purposes is expected to be represented in the source document." Among other things this means it affects only how characters display, not their number or identity. (BTW, my Firefox browser copies ß as lowercase ß to the clipboard, i.e. it also preserves "correct casing for semantic purposes ... in the source document.") The specification stipulates a little lower: "The UA must use the full case mappings for Unicode characters", which means that ß must be transformed to SS. By Wikipedia standards, this is probably WP:SYNTHESIS, but the only interpretation is that the result of the transformation must be a single-character ß. — The CSS property "text-transform" only affecting the presentation layer but not the number of characters is essential for letter spacing: Straße transformed to uppercase with a letter spacing of 0.3em yields Straße; there's an increased spacing between six characters rather than seven. Love —LiliCharlie (talk) 01:46, 4 November 2020 (UTC)[reply]

Thank you for your detailed and interesting reply. All I wanted to know, though, was: Which concrete CSS file drives that behavior in our case? Sorry for your extra work; my parenthetical remarks probably contributed to the misunderstanding by making the question look bigger than it was intended. ◅ Sebastian 14:05, 4 November 2020 (UTC)[reply]

If that's what you're asking: The CSS code is the text-transform:uppercase; in the code '''ß''' which is similar to the example given in the lead of article Span and div (where the CSS code is color: red;). The files you are referring to define styles that apply to several or all Wiki pages a user views, but in this case I wanted my code to apply only to a few characters on one Wikipedia page in a way that doesn't require readers to edit their user-defined CSS files. Love —LiliCharlie (talk) 22:48, 4 November 2020 (UTC)[reply]

Oh, thank you! I expected too much of an explicit definition; it didn't occur to me that that specific behavior is already part of the standard behavior for text-transform:uppercase;. ◅ Sebastian 16:04, 12 November 2020 (UTC)[reply]

BTW, i just came across this character, which i might have added to my answer: 囧. ◅ Sebastian 10:35, 13 November 2020 (UTC)[reply]

That's a nice one. — Due to Han unification, there is a difference in design between the simplified Chinese character 囧 (generated using 囧) and the traditional 囧 (囧). What you see without language markup depends on your default computer/browser settings. Love —LiliCharlie (talk) 11:43, 13 November 2020 (UTC)[reply]

Phonetics

In Italian, ŋ is a separate sound that is under "N" but it is costant and part of our phonetics, so it is a costant sound when it ends with "N" and starts with "G" or "C (K)". I don't understand, it's not a dialectal difference, nation wise it is pronounced like that. --Egosivexisto (talk) 20:37, 8 November 2020 (UTC)[reply]

Yes. More precisely, the phone [ŋ] (velar nasal) is the allophone of the phoneme /n/ that occurs before velar consonants. In other words: There is a rule when /n/ is pronounced [ŋ], and you mention that rule in a non-technical way, but since [n] and [ŋ] cannot occur in the same environments (before the same consonants) they never contrast to distinguish words or phrases, and are not separate phonemes. Therefore, a table of phonemes is not the right place for the Italian velar nasal. (Similarly, the phone [ɱ] (labiodental nasal) only occurs before labiodental consonants. The general phonological rule is that Italian sequences of nasal+consonant are homorganic, as in Spanish, but unlike in English and German.) Love —LiliCharlie (talk) 22:04, 8 November 2020 (UTC)[reply]

Shuowen Jiezi radicals

Hi colleague, it's the first time I open a discussion in a foreign Wikipedia, I'm from the Italian Wikipedia! I took the most conservative versions (Small Seal) from the Shuowen Jiezi and a dictionary of Chu' Nom with Chinese characters variants, https://www.zdic.net/ and https://hvdic.thivien.net/. If you don't like 示, feel free to change it. I spent more or less two weeks doing the reconstruction of all 540 characters (that is to say, I explained their origin and found the correct version. I also did the whole reconstruction of the current version of the HSK4). Sorry if I didn't put the sources, this is not random, I put many efforts in this work :3 Cicognac (talk) 23:53, 15 November 2020 (UTC)[reply]

Hi Cicognac, many thanks for coming here to improve our List of Shuowen Jiezi radicals.

The best source for Han character correspondences of seal script characters that I know is the electronic Wenlin Dictionary. It contains all Shuowen characters and their original definitions in seal script (in vector format, created by the people who also created the Character Description Language) along with the same content on two levels of Han characters: Older ones that are usually closer approximations of the original seal script, and modern ones.

If you like we can make a new list that contains alternative Han characters instead of only one, perhaps in a sandbox like List of Shuowen Jiezi radicals/sandbox, and then paste it into the article. user:Yug from the Ancient Chinese characters project and user:Dimitrius from the Russian Wikipedia (article ru:Список ключей Шовэнь цзецзы) are also interested in this topic and perhaps willing to help us. When we are done the table could serve us as the basis for articles in our "native" Wikipedias (in my case, the German Wikipedia).

P.S.: I am the person who created the illustrations file:The 540 Shuowen Seal Radicals.svg and file:The 540 Shuowen Seal Radicals - numbered.svg. Love —LiliCharlie (talk) 01:17, 16 November 2020 (UTC)[reply]

MPF, I can't find the Shuowen Jiezi dictionary in the website you gave me. Where did you retrieve it from, exactly? I would like to put the original version, the most conservative version in the list since they ARE the Shuowen radicals (I'm not interested in other versions than the Small Seal for this topic). What we use today are either traditional or simplified characters and, while doing philology, they are quite useless: if you can show to the reader a conservative version and/or the original version, that's better. The same applies to the Shuowen radicals. That's why I put all that corrections in the Italian version of the article, https://it.wikipedia.org/wiki/Radicali_dello_Shuowen_Jiezi. It's 100% mine, many of the longest articles in Wiki.it are mine. The longest article in Wiki.it was the reconstruction fo the HSK4, which is now split in 5 parts. As for this handful of correction, if they're not enough and you know a better source with variants, we could use that to add more corrections. As I told you before, I found a lot of variants in the Han Tu Tu Dien and I compared them to the Small Seal version in this dictionary and in the Shuowen Jiezi definition in zdic.net. There shouldn't be many corrections missing, so you can simply copy the existing version and put the corrections. What is your best source, exactly? Can you copy and paste the exact link to the page? (I'll only use free dictionaries in this moment). To conclude, if you think that my version is quite ok, you can copy and paste from my version, it will take you 10 minutes more or less. I've already done the whole research. I'm waiting for your source. Thank you! Cicognac (talk) 09:47, 16 November 2020 (UTC)[reply]

The Wenlin Dictionary is neither online nor free; it is a computer programme, and it is not exactly cheap. — I have decided to produce a PDF in searchable PDF/A-3b archive format that I'll upload to the Commons, but this will take some time because I intend to build a dedicated Shuowen radical font for that, with the seal glyphs in the Private Use Area and the Han characters in a homogeneous style. (I'm not sure if all the outdated Han characters needed are already encoded in Unicode. Though the dictionary uses Unicode where possible, it doesn't depend on it and contains non-Unicode Chinese characters in Character Description Language format.) — The PDF will be expandable, i.e. it will be possible to add Han characters from other sources, and then upload revised versions of the document to the Commons. Love —LiliCharlie (talk) 04:14, 17 November 2020 (UTC)[reply]

What if we wait for the Unicode creation and standardization of the Shuowen Jiezi characters? They started a year ago, it shouldn't take them too much time, after they have finished we can download the font and use it (they're also standardizing the oracle bones characters). Otherwise, I can wait for you personal standardization. I can buy stuff, but I think I'll buy a nice dictionary of 字源, I am not so willing to buy the Wenlin Dictionary. It doesn't bother me, in this moment I'm not dealing with them and I can wait, in the meanwhile I can work with many other possibile topics (i.e. the Sino-xenic pronunciation), other projects and have a break (I worked non-stop work 2 months). No, I think some characters are not encoded. I did my best with the available variants, for now I think it's ok. If you don't like my version of the radicals, it doesn't matter. Now that I know that Unicode is creating a font for these characters, I/we can have a look from time to time. This is a good news for those who work with these characters. If need be, feel free to start a conversation in my talkpage in the Italian versions, foreigners are always welcome! Cicognac (talk) 22:59, 17 November 2020 (UTC)[reply]

Unicode did not start to discuss the encoding of seal script characters a year ago. This goes back to 2003, see their page Topical Document List: Seal Script. (I recommend viewing the proposal and charts of 2019-06-19, as well as the later documents. Richard Cook who repeatedly contributed to that discussion is one of the authors of the Wenlin Dictionary, as well as one of the two creators of the Character Description Language.) Also note that seal script hasn't made it to the Uncode pipeline yet, and that the Unicode Consortium does not provide fonts for public use — Unicode doesn't encode glyphs, but characters —, and that wide-spread font support (esp. in OS distributions) may take a decade or so to take effect.

I am preparing the PDF because I don't expect you to have access to the Wenlin dictionary. It will inform you of its content, and enable you to add more Han characters that correspond to seal script Shuowen radicals. Love —LiliCharlie (talk) 20:25, 19 November 2020 (UTC)[reply]

ArbCom 2020 Elections voter message

Hello! Voting in the 2020 Arbitration Committee elections is now open until 23:59 (UTC) on Monday, 7 December 2020. All eligible users are allowed to vote. Users with alternate accounts may only vote once.

The Arbitration Committee is the panel of editors responsible for conducting the Wikipedia arbitration process. It has the authority to impose binding solutions to disputes between editors, primarily for serious conduct disputes the community has been unable to resolve. This includes the authority to impose site bans, topic bans, editing restrictions, and other measures needed to maintain our editing environment. The arbitration policy describes the Committee's roles and responsibilities in greater detail.

If you wish to participate in the 2020 election, please review the candidates and submit your choices on the voting page. If you no longer wish to receive these messages, you may add {{NoACEMM}} to your user talk page. MediaWiki message delivery (talk) 02:31, 24 November 2020 (UTC)[reply]

Shuowen Jiezi last part of 2nd revision

Ehi, LiliCharlie, did something happened in the last 2 weeks more or less? You didn't finish the last part of the revision of the Radicals chart, there is a handful of addenda missing, then the chart can be posted for everyone's use! In this moment, my internet connection is working (I have had problems in the last few days), so I can try to contact you. Cicognac (talk) 09:35, 15 December 2020 (UTC)[reply]

A Joyous Yuletide to You!

Merry Christmas and a Prosperous 2021!

Hello LiliCharlie, may you be surrounded by peace, success and happiness on this seasonal occasion. Spread the WikiLove by wishing another user a Merry Christmas and a Happy New Year, whether it be someone you have had disagreements with in the past, a good friend, or just some random person. Sending you heartfelt and warm greetings for Christmas and New Year 2021.
Happy editing,
JACKINTHEBOX • ^TALK 16:18, 24 December 2020 (UTC)[reply]

Spread the love by adding {{subst:Seasonal Greetings}} to other user talk pages.

Hangul

"All basic letters outline their articulator's shape and phonetic features when pronouncing them" is conciser than current version with the same meaning. I don't think your first comment hits the big picture of my changes. "it takes TWO articulators to produce an articulatory stricture (usu. an active and a passive one, or two active ones in the case of glottals and, to some degree at least, bilabials as well" I can buy the second comment. — Preceding unsigned comment added by Luminans (talk • contribs) 04:13, 27 December 2020 (UTC)[reply]

National language

I did not add spam, I just fixed the error: See: https://en.wikipedia.org/w/index.php?title=National_language&diff=prev&oldid=1003425597&diffmode=source Someone added the link before me. Please read previous edits before saying is spam. Mistah B (talk) 23:12, 28 January 2021 (UTC)[reply]

OMG, I reverted to the wrong version using Twinkle (not manually). It seems I didn't realise you had just made the edit I intended to make, and I confused the left side of my screen (showing the spam link) with the right one (link removed). Thanks for correcting my mistake. Love —LiliCharlie (talk) 23:59, 28 January 2021 (UTC)[reply]

Oromo language speaker

Moved to Talk:List of languages by number of native speakers § Oromo language speaker

Hi LiliCharlie,

I have seen you reverted the change to oromo language. according Ethnologue, oromo has 34mil speakers which is updated on the oromo language page. south, west, central,east oromo speakers are under one Oromia state of Ethiopia. oromo has One language in spite of little dialect within the some of 21 administrative zones of oromia, they all mutually intelligible. The 2011 census reported the population of Oromia as 35,000,000; this makes it the largest regional state in population. Oromia is also the largest regional state and the world's forty-second most populous subnational entity, and the most populous subnational entity in all of Africa. In the following source oromo speakers is 34 million Oromo at Ethnologue (18th ed., 2015) Borana–Arsi–Guji–Wallaggaa-Shawaa Oromo at Ethnologue (18th ed., 2015) Eastern Oromo at Ethnologue (18th ed., 2015) Orma at Ethnologue (18th ed., 2015) West Central Oromo at Ethnologue (18th ed., 2015) Waata at Ethnologue (18th ed., 2015) MfactDr (talk) 09:08, 16 February 2021 (UTC)[reply]

little bit background of the language. With 33.8% Oromo speakers, followed by 29.3% Amharic speakers, Oromo is the most widely spoken language in Ethiopia.^[1] It is also the most widely spoken Cushitic language and the fourth-most widely spoken language of Africa, after Arabic, Hausa and Swahili.^[2] Forms of Oromo are spoken as a first language by more than 35 million Oromo people in Ethiopia and by an additional half-million in parts of northern and eastern Kenya.^[3] It is also spoken by smaller numbers of emigrants in other African countries such as South Africa, Libya, Egypt and Sudan.

Oromo serves as one of the official working languages of Ethiopia^[4] and is also the working language of several of the states within the Ethiopian federal system including Oromia,^[1] Harari and Dire Dawa regional states and of the Oromia Zone in the Amhara Region. It is a language of primary education in Oromia, Harari, Dire Dawa, Benishangul-Gumuz and Addis Ababa and of the Oromia Zone in the Amhara Region. It is used as an internet language for federal websites along with Tigrinya.^[5]^[6] Under Haile Selassie's regime, Oromo was banned in education, in conversation, and in administrative matters.^[7]^[8]^[9] thanks.MfactDr (talk)

References

^ ^a ^b "The world factbook". cia.gov.
^ "Children's books breathe new life into Oromo language". bbc.co.uk.
^ "Archived copy". Archived from the original on 2016-08-25. Retrieved 2016-08-22.{{cite web}}: CS1 maint: archived copy as title (link)
^ Shaban, Abdurahman. "One to five: Ethiopia gets four new federal working languages". Africa News.
^ http://www.mcit.gov.et/. {{cite web}}: Missing or empty |title= (help)
^ "ቤት | FMOH". www.moh.gov.et.
^ Oromo children's books keep once-banned Ethiopian language alive, retrieved February 14, 2016
^ Language & Culture (PDF)
^ ETHIOPIANS: AMHARA AND OROMO, January 2017

Persian speaker total

Moved to Talk:List of languages by total number of speakers § Persian speaker total

Hello there, I think u’ve made a mistake about total number of speakers. The page has ignored people with Persian language as their second language and it’s kinda disrespectful to ignore those people. Total number of Persian speaking ppl is around 110 million and u can find the references under my edition of the page. So please go read the references before rewinding the page to the 55 million version. With considering Persian speakers population only as 55 million, you are ignoring Iranian kurds, Iranian Turkmens, Iranian Arabs, Iranian Baluchs, Iranian Azeris, Iranian Armenians, Iranian Georgians, Iranian Lurs, Iranian Bakhtiaris etc. all of these ethnicities speak Persian as their second language which is not considered by Wikipedia which may bring rage and hate against wikipedia. There are also other people in Afghanistan, Tajikistan, Pakistan, Uzbekistan, Iraq, Bahrain etc. who speak Persian as their second language. We don’t care if the unified source is complete or not. We just care about being considered in the numbers. So please respect us and consider us as people with their second language as Persian.

From a Bakhtiari minatory in Iran. — Preceding unsigned comment added by 5.115.18.69 (talk • contribs) 22:59, 19 February 2021 (UTC)[reply]

Long s

I've finished the spring clean. I'd appreciate a read through to see if I have missed anything or broken anything, if you have time? --John Maynard Friedman (talk) 20:50, 24 February 2021 (UTC)[reply]

Hi John, I'll've time for that tomorrow, I hope, but I'm afraid you've done so good a job I won't be able to find anything. Love —LiliCharlie (talk) 22:05, 24 February 2021 (UTC)[reply]

Thank you. My big weakness is closing brackets and such. Also, I know that I haven't been consistent with s versus 's' or even "s". And maybe I should actually have used ⟨s⟩ (but ⟨long s⟩ is not correct, so perhaps not). "Your mission, if you choose to accept it... etc etc.--John Maynard Friedman (talk) 00:15, 25 February 2021 (UTC)[reply]

At the risk of rehashing an old argument... where was the consensus formed to use {{char}}? The TfD was closed as kept but that was precisely so that the use of the template, not the template per se, could be hashed out elsewhere. Did that happen? Nardog (talk) 08:32, 25 February 2021 (UTC)[reply]

As far as I know, I limited my use of {{char}} to the just those cases where the character being discussed needs to be isolated for that purpose.

In this case, we have an unusual and unfamiliar symbol that certainly needs to be mounted on a microscope slide, as it were. (I see no similar reservations about {{code}}, which is in widespread use despite its rendering the symbol(s) in monospace and so distorting them.) I have used {{angbr}} for graphemes and (more controversially) digraphs. If there are any cases where angbr is more appropriate, I have no objection to that change. I am far more concerned about unhelpful markup like parentheses, quote marks and (far worse) italics. --John Maynard Friedman (talk) 09:49, 25 February 2021 (UTC)[reply]

Rereading the article this morning as a final copy-edit, afaics {{char}} appears exactly twice, once with a long s and once with a round s. Short of outright opposition to the template being used in any context, I don't really understand your point? All it does is draw a one-pixel square around the symbol under examination. It doesn't intrude. It is invisible to screen readers without affecting their function. It doesn't affect people with visual impairment using magnifiers like italics do. Short of outright opposition to the template being used in any context, I don't really understand your point?--John Maynard Friedman (talk) 13:55, 25 February 2021 (UTC)[reply]

Double marks

Just FYI, I plan to write updates to bracket etc this evening to describe the precomposed double brackets etc. Unless you do it first of course! --John Maynard Friedman (talk) 12:16, 28 February 2021 (UTC)[reply]

RfC notice

This is a neutral notice sent to all non-bot/non-blocked registered users who edited Wikipedia talk:WikiProject Linguistics in the past year that there is a new request for comment at Wikipedia talk:WikiProject Linguistics § RfC: Where should so-called voiceless approximants be covered?. Nardog (talk) 10:54, 27 July 2021 (UTC)[reply]

ArbCom 2021 Elections voter message

Hello! Voting in the 2021 Arbitration Committee elections is now open until 23:59 (UTC) on Monday, 6 December 2021. All eligible users are allowed to vote. Users with alternate accounts may only vote once.

The Arbitration Committee is the panel of editors responsible for conducting the Wikipedia arbitration process. It has the authority to impose binding solutions to disputes between editors, primarily for serious conduct disputes the community has been unable to resolve. This includes the authority to impose site bans, topic bans, editing restrictions, and other measures needed to maintain our editing environment. The arbitration policy describes the Committee's roles and responsibilities in greater detail.

If you wish to participate in the 2021 election, please review the candidates and submit your choices on the voting page. If you no longer wish to receive these messages, you may add {{NoACEMM}} to your user talk page. MediaWiki message delivery (talk) 00:28, 23 November 2021 (UTC)[reply]

Kate Hepburn's accent

I read the article which deals with this accent, and, accordingly, I didn't understand whether this accent was chiefly British or American. Listening to Katherine Hepburn convinced me that this accent was predominantly British, to say nothing of Billie Burke. I'm a novice in phonetics, so could you help me with answering this question? Роман Сергеевич Сидоров (talk) 07:54, 23 October 2022 (UTC)[reply]

ArbCom 2022 Elections voter message

Hello! Voting in the 2022 Arbitration Committee elections is now open until 23:59 (UTC) on Monday, 12 December 2022. All eligible users are allowed to vote. Users with alternate accounts may only vote once.

The Arbitration Committee is the panel of editors responsible for conducting the Wikipedia arbitration process. It has the authority to impose binding solutions to disputes between editors, primarily for serious conduct disputes the community has been unable to resolve. This includes the authority to impose site bans, topic bans, editing restrictions, and other measures needed to maintain our editing environment. The arbitration policy describes the Committee's roles and responsibilities in greater detail.

If you wish to participate in the 2022 election, please review the candidates and submit your choices on the voting page. If you no longer wish to receive these messages, you may add {{NoACEMM}} to your user talk page. MediaWiki message delivery (talk) 01:12, 29 November 2022 (UTC)[reply]

[1] ttps://upload.wikimedia.org/wikipedia/commons/5/58/Karte_des_Deutschen_Reichs_%28Vogel%29_-_Section_22_-_Strassburg.png

[The_world_factbook-2] "The world factbook". cia.gov.

[3] "Children's books breathe new life into Oromo language". bbc.co.uk.

[4] "Archived copy". Archived from the original on 2016-08-25. Retrieved 2016-08-22.{{cite web}}: CS1 maint: archived copy as title (link)

[africanews.com-5] Shaban, Abdurahman. "One to five: Ethiopia gets four new federal working languages". Africa News.

[6] ttp://www.mcit.gov.et/. {{cite web}}: Missing or empty |title= (help)

[7] "ቤት | FMOH". www.moh.gov.et.

[8] Oromo children's books keep once-banned Ethiopian language alive, retrieved February 14, 2016

[9] Language & Culture (PDF)

[10] ETHIOPIANS: AMHARA AND OROMO, January 2017

[1]

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]