Benutzer Diskussion:Stefan Kühn/Check Wikipedia

Dies ist eine alte Version dieser Seite, zuletzt bearbeitet am 2. Januar 2009 um 12:02 Uhr durch Miraceti (Diskussion | Beiträge) (Neuer Abschnitt False positive detection of error 7). Sie kann sich erheblich von der aktuellen Version unterscheiden.

Letzter Kommentar: vor 16 Jahren von Miraceti in Abschnitt False positive detection of error 7
Archiv
Wie wird ein Archiv angelegt?

<references /> but no <ref>

Your script lists articles with <ref>s, but without <references />. Can it also perform inverse check: <references /> (or, of course, template like {{reflist}} on enwiki) but no <ref>? Matma Rex answer me on plwiki 14:13, 12. Dez. 2008 (CET)Beantworten

So, what about it? Matma Rex answer me on plwiki 22:23, 19. Dez. 2008 (CET)Beantworten
The problem is, that sometime the <ref>-tag is inside a template. I can not scan this. -- sk 16:44, 23. Dez. 2008 (CET)Beantworten
I dont think it's so common. And if it is, detection can be turned off, right? Matma Rex answer me on plwiki 16:03, 25. Dez. 2008 (CET)Beantworten

error 036

Error 036 reports a form of broken redirects. Can you expand this feature to detect any illegal character between #REDIRECT and [[ ? For example, in Japanese Wikipedia many broken redirects have IDEOGRAPHIC SPACE (U+3000). --fryed-peach 15:11, 14. Dez. 2008 (CET)Beantworten

Yes I can expand this. Can you give me an example. Thanks. -- sk 09:13, 19. Dez. 2008 (CET)Beantworten
See old version of ja:商業放送. The source text is:
#REDIRECT [[民間放送]]

, which contains IDEOGRAPHIC SPACE after #REDIRECT. --fryed-peach 13:43, 19. Dez. 2008 (CET)Beantworten

Unterschied Fehler 016 und Fehler 027?

Hallo Stefan, habe mir die ganzen möglichen Fehler eben angesehen, dazu eine Frage aus reiner Neugierde heraus: Mir sind die Fehler 016 (Line very long) und 027 (Long line) aufgefallen. Gibt es einen bestimmten Grund warum zwischen 150 Zeichen und 80 Zeichen in einer Zeile mit Leerzeichen am Anfang unterschieden wird? Bei Fehler 027 (80 Zeichen) wird Quelltext vermutet, bei 016 (150 Zeichen) ein "Fehler" - aber Quelltext-Zeilen dürfen auch 150 Zeichen lang sein und Fehler gibt es auch bei 80 Zeichen ;-) Wäre es nicht einfacher das zusammenzufassen? --PasO 21:41, 14. Dez. 2008 (CET)Beantworten

Das ist ein Überbleibsel aus zwei erfolglosen Versuchen diesen Fehler zu programmieren. Beide Fehler sind derzeit total deaktiviert. Vielleicht hab ich ja später mal Zeit dafür. Ich wollte damit alle Zeilen ausfindig machen, die an erster Stelle ein Leehrzeichen haben und dann mehr als eine Bildschirmbreite lang sind. Leider hab ich da zuviele Probleme gehabt und es erstmal gelassen.-- sk 09:16, 19. Dez. 2008 (CET)Beantworten

<p> within <blockquote> (English Wikipedia)

Hi Stefan ... At en Wikipedia <blockquote> html is listed as acceptable "Wiki markup" language, and <p>s are necessary within <blockquote> for paragraphing (blank lines won't paragraph inside blockquote). QUESTION: Do you plan to adjust your Perl program to ignore <p>s within <blockquote> ... </blockquote>? (NOTE: I believe it is true that <blockquote> formatting can be replaced with {{quote| ... }} which does not need <p> to paragraph -- but as long as <blockquote> is listed as acceptable "Wiki markup," well, you get the point. :) Proofreader77 23:33, 15. Dez. 2008 (CET)Beantworten

Can you give me an example. So I can better understand what you meen. I think it is possible to check for <blockquote> ... </blockquote>. -- sk 09:31, 19. Dez. 2008 (CET)Beantworten
Sure ...
  • Go to: en:Wikipedia:WikiProject_Check_Wikipedia#HTML_text_style_element_.3Cp.3E
  • Then notice en:Alcopop table entry: "<blockquote><p>Flavored malt beverages are brewery products that differ from traditional..."
  • NOTE: Within <blockquote>, Wikipedia does not treat blank lines as a new paragraph. If you want a paragraph inside <blockquote>, you must add explicit paragraph codes.
  • As I mentioned, {{quote| ... could be used instead of <blockquote>with <p>s, BUT <blockquote> has apparently been on the list of accepted html in en.Wikipedia for a long time. :)
Cheers! Proofreader77 20:07, 19. Dez. 2008 (CET)Beantworten

related: <p> within bullet (line beginning with *)

Far rarer, but a simpler one to program (IF line begins with "*" THEN ignore <p>s).

EXAMPLE: From en:Clinical psychology

*'''Existential.''' [[Existential therapy|Existential psychotherapy]] postulates that people are largely free to choose who we are and how we interpret and interact with the world. It intends to help the client find deeper meaning in life and to accept responsibility for living. As such, it addresses fundamental issues of life, such as death, aloneness, and freedom. The therapist emphasizes the client’s ability to be self-aware, freely make choices in the present, establish personal identity and social relationships, create meaning, and cope with the natural anxiety of living.<ref>Van Deurzen, Emmy. (2002). ''Existential Counseling & Psychotherapy in Practice.'' London; Thousand Oaks : Sage Publications. ISBN 0761962239</ref> Important writers in existential therapy include [[Rollo May]], [[Victor Frankl]], [[James Bugental]], and [[Irvin Yalom]].<P>One influential therapy that came out of Existential therapy is [[Gestalt Therapy]], primarily founded by [[Fritz Perls]] in the 1950s. It is well-known for techniques designed to increase various kinds of self-awareness—the best-known perhaps being the "empty chair technique"—which are generally intended to explore resistance to authentic contact, resolve internal conflicts, and help the client complete "unfinished business".<ref name="woldt">Woldt, Ansel and Toman, Sarah. (2005). ''Gestalt Therapy: History, Theory, and Practice''. Thousand Oaks, CA. : Sage Publications. ISBN 0761927913</ref>


In this case you should not use <p>. You should use <br />. This is better, because you need no </p>. If you write only <p> without </p>, than you have not valid XHTML. -- sk 09:30, 19. Dez. 2008 (CET)Beantworten
Yes, <p> without matching </p> is not "correct" html (although many break this rule:), so let us use the code correctly (<p> ...</p>)... BUT THEN NOTE: </br> does not produce a blank line between paragraphs (as usual).
COMPARE: (<p>... </p>)
  • Existential. Existential psychotherapy postulates that people are largely free to choose who we are and how we interpret and interact with the world. It intends to help the client find deeper meaning in life and to accept responsibility for living. As such, it addresses fundamental issues of life, such as death, aloneness, and freedom. The therapist emphasizes the client’s ability to be self-aware, freely make choices in the present, establish personal identity and social relationships, create meaning, and cope with the natural anxiety of living.[1] Important writers in existential therapy include Rollo May, Victor Frankl, James Bugental, and Irvin Yalom.

    One influential therapy that came out of Existential therapy is Gestalt Therapy, primarily founded by Fritz Perls in the 1950s. It is well-known for techniques designed to increase various kinds of self-awareness—the best-known perhaps being the "empty chair technique"—which are generally intended to explore resistance to authentic contact, resolve internal conflicts, and help the client complete "unfinished business".[2]

(</br>)
  • Existential. Existential psychotherapy postulates that people are largely free to choose who we are and how we interpret and interact with the world. It intends to help the client find deeper meaning in life and to accept responsibility for living. As such, it addresses fundamental issues of life, such as death, aloneness, and freedom. The therapist emphasizes the client’s ability to be self-aware, freely make choices in the present, establish personal identity and social relationships, create meaning, and cope with the natural anxiety of living.[3] Important writers in existential therapy include Rollo May, Victor Frankl, James Bugental, and Irvin Yalom.
    One influential therapy that came out of Existential therapy is Gestalt Therapy, primarily founded by Fritz Perls in the 1950s. It is well-known for techniques designed to increase various kinds of self-awareness—the best-known perhaps being the "empty chair technique"—which are generally intended to explore resistance to authentic contact, resolve internal conflicts, and help the client complete "unfinished business".[2]
As I noted, <p> within a bullet line is is FAR less common than within <blockquote> ...</blockquote>, but is (apparently?)OK in English Wikipedia. Proofreader77 20:35, 19. Dez. 2008 (CET)Beantworten

Not really a question, just highlighting an example of a page listed in your program's output. Proofreader77 05:13, 16. Dez. 2008 (CET)Beantworten

Highlighting is a feature for the future. :-) --sk 09:32, 19. Dez. 2008 (CET)Beantworten
Cheers! Proofreader77 20:35, 19. Dez. 2008 (CET)Beantworten

error number 30, no image description

Found a false positive from fi.wiki. [[Image:Cis-2-butene.PNG|cis-2-buteeni|frame|right]] this image HAS a description: "cis-2-buteeni", yet it is listed as not having one. Why is that? Can this be fixed? -- Kimmo Laine / 88.195.95.74 06:55, 19. Dez. 2008 (CET)Beantworten

At the moment my script find not the description in the middle of all the other parameters. Normally we use the description at the end of the image. For example: [[Image:xy.jpg|frame|right|cis-2-buteeni]]! It is better to reed. In the German Wikipedia we had only <20 errors like this. Please write the description at the end. -- sk 09:12, 19. Dez. 2008 (CET)Beantworten
I took a peak the the script, and with my feeble understanding of perl and regexp, I think you are just testing if the last parameter of image call is 'thumb', 'left', 'right' or a pixel value. In that case, you are not checking if it's 'frame'. Wouldn't an image like [[Image:foo.jpg|frame]] pass as having a description, thought it doesn't? That'd be a false negative. I don't know if such cases exist, but it's a possibility. -- Kimmo Laine / 62.142.59.83 10:24, 19. Dez. 2008 (CET)Beantworten
You are right. At the moment I check only for the last parameter. But we have so much errors, that I don't want explode the list. :-) "frame" is one and a normal image without all is the other one ([[image:xy.jpg]]). If the number of errors smaller I will insert this. -- sk 11:45, 19. Dez. 2008 (CET)Beantworten
I insert "frame" in the script. -- sk 22:46, 20. Dez. 2008 (CET)Beantworten

Missing template errors

The section "Template programming element" doesn't discover all errors. It should look for any occurrence of "{{{" in the article namespace, not just "{{{1}}}". --LA2 09:00, 19. Dez. 2008 (CET)Beantworten

I know that this are not all elements. Can you give me an example? Why not "{{{1}}}"? -- sk 09:08, 19. Dez. 2008 (CET)Beantworten
Look at this syntax correction that I made. Some earlier user had by mistake used subst: on a template instead of just calling the template. As a result, the article included both <noinclude> and {{{width|200}}}. These elements should not appear in articles, but this error was not detected by Check Wikipedia. --LA2 09:26, 20. Dez. 2008 (CET)Beantworten

Defaultsort problem

How to fix this ones:

82.171.148.8 18:29, 20. Dez. 2008 (CET)Beantworten

 Ok I have fix this in the script. -- sk 11:03, 29. Dez. 2008 (CET)Beantworten

Translation

Hi! Thanks for the statistics! I have a question: do we really need lines matching error_\d\d\d_(prio|head|desc)_script in our translation? It seems, they are ignored and only lines matching error_\d\d\d_(prio|head|desc)_..wiki are used. We had had error_033_prio_script=0 END by accident in the translation but the test was run anyway. I do not know if it was run because we had had also error_033_prio_cswiki=3 END.[1] Miraceti 09:10, 23. Dez. 2008 (CET)Beantworten

All parameter with end ..._script are only for information. This is the parameter which is set in the script by me. If you want change the procedure in cswiki then please change the parameter ..._cswiki. When you set error_033_prio_script=0 END then nothing will happen. If you set error_033_prio_cswiki=-1 END the error 33 will set "deactivated". -- sk 09:24, 23. Dez. 2008 (CET)Beantworten
So it is as I supposed. We can take them out. Thanks.
Are you sure that error_033_prio_cswiki=-1 END causes error's deactivation? I thought it means "unknown" -> use the default _prio_script value. The deactivation should be done by error_033_prio_cswiki=0 END, shouldn't be? Miraceti 12:25, 23. Dez. 2008 (CET)Beantworten
Yes, that is right. "0" means deactivated. -- sk 13:03, 23. Dez. 2008 (CET)Beantworten

Hi, it would be useful if your new «statistics» section was providing links for errors having at least one detection, so that users can jump directly to sections of interest to them. Instead of:

| 6 || description_of_error_6 || middle || unkown || style="text-align:right;" | || style="text-align:right;" | 1 || ↗ || style="text-align:right;" | 1

use

| 6 || [[#description_of_error_6|description_of_error_6]] || middle || unkown || style="text-align:right;" | || style="text-align:right;" | 1 || ↗ || style="text-align:right;" | 1

Errors having 0 detections do not have sections on the page, so better not put a link for those errors, if possible.

Thanks --66.131.214.76 01:15, 24. Dez. 2008 (CET)Beantworten

I will try this. -- sk 09:18, 27. Dez. 2008 (CET)Beantworten

False positive in error 037 : articles missing DEFAULTSORT

An article like S/2003 J 10 is wrongly reported in section 037. Symbol «/» should not cause a report.

Comment #Defaultsort problem above also indicated an incorrect report in section 006 (DEFAULTSORT with special token) in NL Wikipedia.

--66.131.214.76 03:11, 24. Dez. 2008 (CET)Beantworten

I will change this. -- sk 09:19, 27. Dez. 2008 (CET)Beantworten

<center>

Can you add a detection of <center> tag? --fryed-peach 03:29, 25. Dez. 2008 (CET)Beantworten

Yes, I can add this. -- sk 09:18, 27. Dez. 2008 (CET)Beantworten

Titel mit Sonderzeichen und ohne DEFAULTSORT

In dieser Sektion tauchen auch Begriffe mit „/“, wie etwa TP/IX auf. Welche DEFAULTSORT-Angabe sollten die denn kriegen?--JFKCom 01:03, 28. Dez. 2008 (CET)Beantworten

„/“ werden rausgenommen aus dem Skript. Siehe auch weiter oben. -- sk 09:16, 28. Dez. 2008 (CET)Beantworten
Ah, jetzt seh' ich's auch weiter oben. Danke für die schnelle Auskunft!--JFKCom 09:44, 28. Dez. 2008 (CET)Beantworten

Hallo Stefan, wie kann ich denn C++ aus der Fehlerliste rauskriegen? Weder das noch das hat geholfen. --UV 22:28, 28. Dez. 2008 (CET)Beantworten

I think with {{DEFAULTSORT:C}} it should work. Please try it again. In the last day my script dont run at the toolserver. Maybe this was the reason. Normaly with DEFAULTSORT it should work. -- sk 11:03, 29. Dez. 2008 (CET)Beantworten

error number 37 section on ca:wikipedia

Hi, there is a little error in error number 37 section on catalan wikipedia, the cawiki_output_for_wikipedia.txt contains the following sentence without the last </nowiki> tag.

El títol té algun caràcter especial i a l'article no hi ha ORDENA o DEFAULTSORT. O bé alguna categoria no té la sintaxi <nowiki>[[Categoria:ABC|Text]]</nowiki>. Regards. --Loupeter 08:44, 29. Dez. 2008 (CET)Beantworten

This was an error in the cawiki translation page. See my change here. -- sk 10:51, 29. Dez. 2008 (CET)Beantworten

Nº 10 false error

Hi, Stefan. First of all, thanks for this really really useful tool, and excuse my bad english. I'm user at es:wiki, and I would like to draw attention upon an error i've seen sometimes on item number 10, "not having the same amount of [[ and ]]". You can see it at es:Cantabria and es:Homosexualidad en los Estados Unidos. Can this be fixed? Ilfirin 20:49, 29. Dez. 2008 (CET)Beantworten

Hello Ilfirin, I work on it, but it is very difficult. The problem is the line break in an image for ref or legend. If you delete the line break my script has no problem. -- sk 21:03, 29. Dez. 2008 (CET)Beantworten
Thanks for the quick answer. What about counting how many [[ and ]] are there when the script finds an error? Or keeping "File"/"Image" (at es:wiki "Archivo"/"Imagen") out of this error list. Or maybe both at the same time: veryfing that the images have the correct amount of [[ and ]] indepently of the line breaks. Greetings, Ilfirin 22:59, 30. Dez. 2008 (CET)Beantworten

The correct/not correct table

Hi Stefan, can you go here it:Discussioni Wikipedia:Elenchi generati offline/Check Wikipedia and complete the table? Or can you create a subpage /Errors and solutions with something like that table? Thanks a lot and happy new year! --Red Power 15:51, 31. Dez. 2008 (CET)Beantworten

False positive detection of error 7

The error 7 was repeatedly detected in cs:Metro v Oslu. The article does not contain it. Please, fix the bug. Miraceti 11:02, 2. Jan. 2009 (CET)Beantworten

  1. Van Deurzen, Emmy. (2002). Existential Counseling & Psychotherapy in Practice. London; Thousand Oaks : Sage Publications. ISBN 0761962239
  2. a b Woldt, Ansel and Toman, Sarah. (2005). Gestalt Therapy: History, Theory, and Practice. Thousand Oaks, CA. : Sage Publications. ISBN 0761927913
  3. Van Deurzen, Emmy. (2002). Existential Counseling & Psychotherapy in Practice. London; Thousand Oaks : Sage Publications. ISBN 0761962239