Welcome to Wikipedia, Marmzok! I am Marek69 and have been editing Wikipedia for quite some time. I just wanted to say hi and welcome you to Wikipedia! If you have any questions, feel free to leave me a message on my talk page or by typing {{helpme}} at the bottom of this page. I love to help new users, so don't be afraid to leave a message! I hope you like the place and decide to stay. Here are some pages that you might find helpful:
I hope you enjoy editing here and being a Wikipedian! Oh yeah, I almost forgot, when you post on talk pages you should sign your name on talk pages using four tildes (~~~~); that should automatically produce your username and the date after your post.
If you need help, check out Wikipedia:Questions, ask me on my talk page, or place {{helpme}} on your talk page and ask your question there. Again, welcome!
Hi Marmzo, I've moved the test page to ckb:User:Xqt/Test of cause it is easier to use for me. I've made first decoding test with the arabic letters. Please have a look to this and give me a report whether this is what you expected. As I am not so familar with arabic transliteration please give me the remainig number letters between the digits 0 and 9 to implement the remainings. Regards --Xqt (talk) 17:24, 11 January 2010 (UTC)[reply]
Hi Xqt; I checked and all of them are correct. And about the numbers:
0 - ٠
1 - ١
2 - ٢
3 - ٣
4 - ٤
5 - ٥
6 - ٦
7 - ٧
8 - ٨
9 - ٩
There are 3 points:
We tried it before (correct the letter in content) with a not-professional BOT, but there was a problem: BOT changed all the not-standard letters in article content's; but there are some words, in every article, like interwiki link (link to articles in other languages) that use letters we want to change, and it make them dead link. That's why we stopped that BOT, and I made request here. The same thing for numbers. An example: we have a link in an article in ckbwiki; to 2006's article in english wiki. BOT (no matter which one) changes it to "٢٠٠٦", and its a link to no-where, a dead link.
Maybe we should limit the BOT and don’t let it make changes in language links(if possible).
We need to correct the same letters in articles’ name (I mean page’s names).
For some reason, some people still use not-standard keyboard (layout), so they search web, or they try reach pages and articles with a word contains not-standard letters. I think we should create redirect page to some pages (pages which their article-name contain letter may cause mistake). An example: correct page-name (with correct letters) هەژار, and we should have pages redirected to it by incorrect page-name like ههژار and ههژار. Or maybe you have better suggestion! Best regards --Marmzok (talk) 20:17, 11 January 2010 (UTC)[reply]
I have excluded the following text parts from beeing changed by this manner:
Please verify the behavior on the test page here. This means that there are no changes inside thes textparts like interwiki links, normal wiki links (this also matches categories and images), hyperlinks (but it changes the description) and so on. Templates also depends on magic words. But sorry, it doesn't change piped links like [[link|description]] yet. According the redirects, I am fine with. On the other hand I guess mw fulltext search would find the right page even it is miss-spelled. If this tests are ok we should go to the real pages. Just put some wiki-links to the given test page of those page you want to be changed and verify after bots work. --Xqt (talk) 06:08, 12 January 2010 (UTC)[reply]
Hi Xqt; I checked it and it is exactly what I want. I putted some wiki-links into test page, and you can run test on them. We have two more points:
We have to change this numbers, too (Persian Num. to Kurdish Num.)
(wrong=>correct)
۰ => ٠
۱ => ١
۲ => ٢
۳ => ٣
۴ => ٤
۵ => ٥
۶ => ٦
۷ => ٧
۸ => ٨
۹ => ٩
While bot tries to make changes in the article, it has to do it in this order: (in UTF-16)
Arabic letter Heh(0648) + Zero width non-joiner(200C) ==> Arabic letter Ae(06D5)
Arabic letter Heh(0648) + . (all kinds) ==> Arabic letter Ae(06D5)
Arabic letter Heh(0648) + ، (all kinds) ==> Arabic letter Ae(06D5)
Arabic letter Heh(0648) + , (all kinds) ==> Arabic letter Ae(06D5)
Arabic letter Heh(0648) + space ==> Arabic letter Ae(06D5)
and then
Arabic letter Heh(0648) + Arabic letter Heh Doachashmee (06BE)
Hi. I've made the requested arabic letter correction at the given pages including the persion numbers. Please have a look at these and approve the behavior. According the utf-16: I am not shure but I guess, python based bots aren't be able to handle that and normally mediawiki also handles utf-8 only. --Xqt (talk) 09:12, 15 January 2010 (UTC)[reply]
Hi Xqt; I checked it and it works good, but has a problem. according to what I said above, some changes should done in a special order, Otherwise it creates wrong words. And about utf-16 & 8, its was just for better understanding of what we need, so I repeat it in utf-8 with some corrections. The bot should do this group of changes first:
Arabic letter Heh(d987)+ Zero width non-joiner(e2808c) ==> Arabic letter Ae(db95) -- example: ە => ە
Arabic letter Heh(d987)+ space ==> Arabic letter Ae(db95)+ space
Arabic letter Heh(d987)+ ، ==> Arabic letter Ae(db95) + ،
Arabic letter Heh(d987)+ . ==> Arabic letter Ae(db95) + .
Then:
Arabic letter Heh(d987) + Arabic letter Heh Doachashmee (dabe)
Hi. It's hard to work and I need your help to confirm the changes. Maybe it would be be the best to contact me at #wikipedia-de-bots or #pywikipediabot.
I tried to understand the first part and I changed the letter-fix to this in this order; guess what ist does ;)
# change و + (one of [‌.،, ]) to ە
# ??? but this space will be lost in this case ??? <<<
text.replace(u'و[.،, ]', u'ە')
# after then, replace remaining و to ھ
text.replace(u'و', u'وھ')
# replace numbers
for i in range(0,10):
text.replace(str(i), u'٠١٢٣٤٥٦٧٨٩'[i])
text.replace(u'۰۱۲۳۴۵۶۷۸۹'[i], u'٠١٢٣٤٥٦٧٨٩'[i])
# replace the rest as requested
text.replace(u'ه', u'ە')
text.replace(u'ه', u'ھ')
text.replace(u'ك', u'ک')
# replace one of [ىي] with ی
text.replace(ur'[ىي]', u'ی')
# new request
text.replace(u',', u'،')
I am shure you will understand this pseudo-code :)
Is that right that و + 2 (space) should become ە ?
If it is done with و replacements it should replace the remaining و with ھ ?
What is about the � stuff. Should this also be implemented ? In this case I don't have a entity like � could you check this?
Dear Xqt, I waited for you in #wikipedia-de-bot, but... . I think this code is improved and summarized:
# change latin comma
text.replace(u',', u'،')
# change in و
# change و + ، to ە + ،
# change و + . to ە + .
# change و + spcae to ە + space
text.replace(u'ه[.، ]', u'ە[.، ]')
# change و + � to ە
# first parametr has two strings, [ و ] and [�]
text.replace(u'ه', u'ە')
# after then, replace remaining و to ھ
text.replace(u'ه', u'ھ')
# replace the K letter
text.replace(u'ك', u'ک')
# replace one of [ىي] with ی
text.replace(ur'[ىي]', u'ی')
# replace numbers
for i in range(0,10):
text.replace(str(i), u'٠١٢٣٤٥٦٧٨٩'[i])
text.replace(u'۰۱۲۳۴۵۶۷۸۹'[i], u'٠١٢٣٤٥٦٧٨٩'[i])
There are two point for consider:
In part 2 of this code, I dont know much about this language, but Im not It would work correctly u'ه[.، ]', u'ە[.، ]' !
In part 3, what should be changed, has 2 letter (one of them can not be seen with this code: � - it exists in many of articles but can not be seen). Regards --Marmzok (talk) 20:10, 15 January 2010 (UTC)[reply]
You has leaved the chat asap ;) The last point was that #x200C-delimiter and I couldn't find the mistake. On our test page I could not see any difference between the origin and the changed word. But I am not so far with arabic letters. Please confirm whether it's good. --Xqt (talk) 16:02, 26 January 2010 (UTC)[reply]
I thought the conversation was over! And about test page, I made a change on that page just like what you said. You said ok, you ran the bot and it did the job very well!(also changed that word on test page). It means every things is alright and now the bot is ready to make mass test changes! (today you said you have't make it ready!!!!!!!!)
I am wondering. I didn't change the behavior since last test edit but you said there is a mistake according the non-joiner. Now the last changes where ok but we must explore in which cases it would'nt work as expected. My idea is to fix all arabic letters on the linked pages of my test page and we check the remaining mistakes if there are any. --Xqt (talk) 10:59, 27 January 2010 (UTC)[reply]
I think Bot should move pages. for example A should move to B and place a redirect in A to B, after this we can manually decide to delete some of them which aren't acceptable (what bot cannot decide about)
Its list of changes:
# replace numbers
:for i in range(0,10):
::text.replace(str(i), u'٠١٢٣٤٥٦٧٨٩'[i])
::text.replace(u'۰۱۲۳۴۵۶۷۸۹'[i], u'٠١٢٣٤٥٦٧٨٩'[i])
#replace the K letter
text.replace(u'ك', u'ک')
# replace one of [ىي] with ی
text.replace(ur'[ىي]', u'ی')
# replace
ه + "_" to ە + "_"
ه + ‌ to ە
ه in end of last word to ە (*)
then
ه to ھ
then
remove all ‌
dear xqt;
we made functions for fixing arabic and persian letters, and set all links exception. But for some reasons it has created some problems, as you know, and I want to ask you to remove this exception. In other words, change it to only [[fa:XXX]] and [[ar:XXX]], transwiki links. For that problem we talked about, being unavailable some of links، its ok.--Marmzok (talk) 16:20, 10 May 2010 (UTC)[reply]
examples given in the chatroom Marmazok 10 May 2010 (UTC)
Disambiguation link notification for March 19
Hi. Thank you for your recent edits. Wikipedia appreciates your help. We noticed though that when you edited Medes, you added links pointing to the disambiguation pages Persian and Aryans (check to confirm | fix with Dab solver). Such links are almost always unintended, since a disambiguation page is merely a list of "Did you mean..." article titles. Read the FAQ • Join us at the DPL WikiProject.