MediaWiki talk:Community Portal/Damage Control Central
More discussion and information about this can be found on these Allspark.com threads:
- "TFWiki Problems, Any idea what's going on?"
- "Major Wiki Work, A thread for additions to Transformers wikis."
Wikia links
Is there no way to strip these out other than editing them out? Given that editing leaves them in the history, that doesn't seem desirable. - SanityOrMadness 08:47, 18 March 2009 (EDT)
- The history doesn't matter as much, since the search engines don't index it, and people generally only look at it if they're editing. --203.97.2.142 15:40, 18 March 2009 (EDT)
- Derik told me he was able to purge all the Wikia links from the Wikia database dump (some half a million of them) including all existing in revisions by editing the import script so they never got into TF Wiki's database, but since we reverted to an even-earlier version of Wikia's database... are we SURE the Wikia links in the history doesn't matter? I would rather be safe than sorry. --FFN 23:25, 18 March 2009 (EDT)
- All history pages have a robots tag with "noindex, nofollow" set. That means that the search engines won't add those pages to their indexes and won't credit Wikia with the links from them. --abates 00:43, 19 March 2009 (EDT)
- Derik told me he was able to purge all the Wikia links from the Wikia database dump (some half a million of them) including all existing in revisions by editing the import script so they never got into TF Wiki's database, but since we reverted to an even-earlier version of Wikia's database... are we SURE the Wikia links in the history doesn't matter? I would rather be safe than sorry. --FFN 23:25, 18 March 2009 (EDT)
- I have a list of all the non-Main pages in my sandbox. Derik just needs to feed it to Deceptitran, and let him loose. --FortMax 14:02, 19 March 2009 (EDT)
Thousands of pages still have Wikia links, mostly the redirects and the talk pages but some of the other pages that may or may not have been fully restored yet. I decided to try to edit them out a few at a time, but it seems that this slow process will take forever (practically). Do you gentle-peeps have a bot that will rip them out or replace them automatically? Or, perhaps, will these pages soon be replaced by backups? C.V. Reynolds 07:38, 20 March 2009 (EDT)
- Oops! Seems that FortMax was several steps ahead of me, there. My hatred for the wikia has made me temporarily blind to some things. Make sure to catch the redirects if they show up on Google, too, as well as the image pages. C.V. Reynolds 07:42, 20 March 2009 (EDT)
One big zip
Would it be possible to have made available a big zip of the stuff recovered from Google's cache? I'm thinking it might come in handy. --abates 01:39, 19 March 2009 (EDT)
Dealing with pages that haven't converted
It looks like some of the pages haven't been converted by the script. Am thinking that the best way to deal with them may be to hit "undo", slap a bookworm tag on the reverted page, and then we'll have a category with the pages which need reconverting. That way we're not leaving too many ugly pages on the Wiki. Does that sound like a feasible plan? --abates 07:23, 19 March 2009 (EDT)
- I was like "WHAT THE HELL" during the transfer of pages. I noticed Weird Al's article is simply titled "\" like somebody saved the cache under the wrong filename. What? --FFN 08:04, 19 March 2009 (EDT)
- The script - in titles, at least - choked partially at a single quotation mark (', sometimes used in place of an apostrophe) by sticking an escape character (the backslash) in front of it, and COMPLETELY at a double quotation mark ("), putting the backslash in then stopping. - SanityOrMadness 08:46, 19 March 2009 (EDT)
- Are Cyclonus (G1) and Scourge (G1) examples of "pages not converting"? Are they (and others like them) going to get automatically redone or should we do as Abates suggested? Thylacine 2000--74.73.131.210 10:04, 19 March 2009 (EDT)
- Those are good examples. Hence my earlier request for a zip file containing all of the downloaded cache files. We don't even know at this stage how many articles failed to convert, but if I have to redo every one of them manually, I will. --abates 14:42, 19 March 2009 (EDT)
Looks like templates, faction symbols, and reference notes are all borked. Can we try that again, or are we looking at 8000 manual fixes? -- Repowers 10:31, 19 March 2009 (EDT)
It looks like User:Maintenance script is interpreting the < and > signs as literal ones we want on the page (converting them into < and >), rather than as markers for HTML tags. Thus why we've got huge messes of HTML on pages. --Andrusi 11:36, 19 March 2009 (EDT)
- Yeah, doing a find-and-replace in Notepad (and then adding in one </p> tag that had disappeared somehow) was enough to make Reggie Simmons at least look right. --Andrusi 11:40, 19 March 2009 (EDT)
- Could we get Deceptitran to go across the site converting them back to < and >? It'd help a lot, plus it'd make the broken pages look better until they're fixed. - Magnus Maximus 23:54, 19 March 2009 (EDT)
- In the meantime, there's a lot of weird code appearing where there should be templates; Template:Comingsoontoy and Template:Picsneeded are two of the biggest offenders I've seen, as they make a mess on every page they appear on. Not to mention what was once Template:Factions, which was lost in the crash and has left a small amount of excess code at the top of many of our pages. --Martonimos 06:28, 20 March 2009 (EDT)
- Could we get Deceptitran to go across the site converting them back to < and >? It'd help a lot, plus it'd make the broken pages look better until they're fixed. - Magnus Maximus 23:54, 19 March 2009 (EDT)
Sitewide notice
Are we able to put up a small sitewide notice like the gun-to-head one on Wikia with a note re what's going on, so that when people browse in from search engines, they can get an idea of what's going on? --abates 14:51, 19 March 2009 (EDT)
- What you want is MediaWiki:Sitenotice. The admins should be able to do that. -- SFH 16:46, 19 March 2009 (EDT)
Things that are fully restored
- The Calendar now has 358 out of 365 days accounted for. I don't think we lost more than 2 pages total-- we never HAD all 365 days in the first place. 100% complete, no further work needed. -Derik 16:15, 19 March 2009 (EDT)
- THAT is a relief. --Lonegamer78 00:29, 20 March 2009 (EDT)
- Category:Robots in Disguise characters all done. --abates 04:10, 20 March 2009 (EDT)
Images
Most of the images uploaded since the date the wiki reverted to have been lost. Any way to recover these, the way we have the rest of the pages, or do we have to track them down and re-upload them ourselves? --Martonimos 16:23, 19 March 2009 (EDT)
- They all exist on the server, even the thumbnails (as you can see if you look at the google cache of any page which was last cached pre-crash. And had images to start with, obviously) - MediaWiki just doesn't know they're there. Ultimately, they may need to be reuploaded, but they won't have been lost. - SanityOrMadness 16:54, 19 March 2009 (EDT)
- I see. So we'll eventually import those the way we're importing everything else? --Martonimos 21:31, 19 March 2009 (EDT)
Dividing up work
It was suggested on the AllSpark pages that if people do pages by category rather than alphabetically, since they'll share common templates. Category:Disambiguation pages have a fairly consistent structure, for instance. Perhaps if people "claim" sets of pages and make a note here when they do so? Cartoons could be split by season, comics split in 50 issue increments, etc. --abates 16:29, 19 March 2009 (EDT)
- As another note that may help - templates which appear borked on the recovered cache pages may already exist from the previous good version of the page, so copying them from there might be easier. --abates 16:46, 19 March 2009 (EDT)
- I'm not sure how I feel about that. On the one hand, it does have the advantage of consistency in how the pages need to be re-formatted.
- On the other hand, it also means there's a chance of missing some pages only in obscure categories, or redundant work (since most pages are in more than one category).
- I think if we do it by category, we need at least a few people willing to go through alphabetically as well checking things (which I'm currently doing). --Jeysie 16:55, 19 March 2009 (EDT)
- I'm planning on doing all of the episode pages (MORE THAN SIX HUNDRED GRAAAAHHH) so they have consistent formatting. So, uh, you guys don't have to worry about them. —Interrobang 17:03, 19 March 2009 (EDT)
- I was going to claim the RID episode pages, but if you're planning on doing them anyway, go ahead. I'll take the RID character pages. I think once we've done most of the pages by category, it'll just be a matter of using the search box to search for HTML fragments, but we'll see. --abates 17:34, 19 March 2009 (EDT)
- I'm planning on doing all of the episode pages (MORE THAN SIX HUNDRED GRAAAAHHH) so they have consistent formatting. So, uh, you guys don't have to worry about them. —Interrobang 17:03, 19 March 2009 (EDT)
New Editing Help Box
Is there any way one of the admins could take a little time to make up a new box with editing shortcuts? A list of common templates would be especially helpful to everyone, I think. --Jeysie 17:04, 19 March 2009 (EDT)
- I agree, I don't know how to do certain things like the tiny faction symbol at the top of the character pages, or making series navigation boxes for cartoons or comics. --Crockalley 07:16, 20 March 2009 (EDT)
Front page notice and browser caches
Jeysie, is there some reason why you changed the front page notice from asking people to help us with google caches to helping us clean up pages? From this page's very own Finish Summary, we've not finished with the google caches yet, as we evidently need:
0400-8599 - Needed
So are we done with caches or not?
Also, this question was largely ignored by everybody, but some fans of the wiki had offered to help out with their browser caches. This presumably means their caches are more up-to-date than most of the pages people have uploaded. What should I tell them to do? --FFN 17:50, 19 March 2009 (EDT)
- I had thought we already saved and uploaded all of the Main article cache pages we could? I know we didn't save all of the Talk pages yet, but I'd personally rather have any editors just coming in set to work on getting our existing important articles cleaned up instead of making a concentrated effort to worry about the Talk pages. If I was wrong, you can change it back, though. --Jeysie 17:54, 19 March 2009 (EDT)
- "0400-8599 - Needed"
- #Finish Summary = Huh? Not what I'm seeing. - SanityOrMadness 17:59, 19 March 2009 (EDT)
- The 0400-8599 are for talk pages, which are obviously a lower priority, but still part of our whole "character." I believe either Derik's or Suki's scripts also went through to grab those as well so they may be partially covered, but manual grabbing would be nice. Bluestreak7 18:26, 19 March 2009 (EDT)
- Thank you. Now, does anybody have an answer to my second question? I feel bad just telling people to keep "holding onto those browser caches". --FFN 18:31, 19 March 2009 (EDT)
- Oh, almost forgot. Have we uploaded all the pages we saved yet? I noticed our Derrick J. Wyatt article is from May 2008. Our google cache is from Feb 26, 2009. --FFN 18:46, 19 March 2009 (EDT)
- Thank you. Now, does anybody have an answer to my second question? I feel bad just telling people to keep "holding onto those browser caches". --FFN 18:31, 19 March 2009 (EDT)
- The 0400-8599 are for talk pages, which are obviously a lower priority, but still part of our whole "character." I believe either Derik's or Suki's scripts also went through to grab those as well so they may be partially covered, but manual grabbing would be nice. Bluestreak7 18:26, 19 March 2009 (EDT)
ROTF Devastator article
Looks pretty bad. Does anyone wanna clean it up or should I? Godziboy1993 18:18, 19 March 2009 (EDT)
- Fixed! --abates 19:14, 19 March 2009 (EDT)
Accounts
Hi, I asked this on the Facebook page but no-one responded. Do I need to create a new account, or will our old accounts eventually be restored? This is User:MistaTee by the way.--68.98.163.92 19:56, 19 March 2009 (EDT)
- I say go ahead and create a new one. When I created one using my old user name, it even went and associated my User and Talk pages with my account, so I didn't lose anything. -- Semysane 20:29, 19 March 2009 (EDT)
- Yeah, it totally remembers who we are; it's just being coy. I tried to re-register with a new password, and it told me I needed to use the old password instead. Be aware, though: Your re-created User page and its Talk page will both have the Wikia-link at the bottom, so you'll need to trim those. Also, my Talk didn't have all the latest conversations, so I needed to go to Google Cache for that. - Jackpot 01:39, 20 March 2009 (EDT)
- Not all of us, only those of us who were around before the reset date. The site doesn't remember a thing about me. --Martonimos 06:26, 20 March 2009 (EDT)
- Yeah, it totally remembers who we are; it's just being coy. I tried to re-register with a new password, and it told me I needed to use the old password instead. Be aware, though: Your re-created User page and its Talk page will both have the Wikia-link at the bottom, so you'll need to trim those. Also, my Talk didn't have all the latest conversations, so I needed to go to Google Cache for that. - Jackpot 01:39, 20 March 2009 (EDT)
Identifying Templates
I think we need a spot where people can help other people out by, well, identifying templates.
Like right now I'm staring at the 1984 page trying to figure out whether the nav at the top was hand-coded or is supposed to be a template. --Jeysie 22:56, 19 March 2009 (EDT)
- That was fairly obviously a template to look at. A glance at That Other Place showed it was Template:Yearofthe, which I "stole" back. - SanityOrMadness 23:10, 19 March 2009 (EDT)
- Ah, there we go. I had that template in my saved cache pages, and thought the title looked promising, but... there was no info on the cache page, and the template didn't exist here, so... eheh, I was stumped. Thanks! --Jeysie 23:30, 19 March 2009 (EDT)
- I know the infoboxes and some others won't work because of the variable fields, but can we maybe set Deceptitran to restore the static templates, like the (empty) stubs, comingsoontoy, noname etc. where they're always the same no matter what page they're on? It should just be a simple find/replace code function. - Magnus Maximus 23:59, 19 March 2009 (EDT)
The year pages are so undeveloped. I feel like they haven't quite found their purpose. (Which is probably why we didn't have a 2009 page even up until last week.) -Derik 23:14, 19 March 2009 (EDT)
Do we have an episode of comicissue template AT ALL right now?
- Episode, yes. Comic, no. - Magnus Maximus 00:25, 20 March 2009 (EDT)
My advice... just go ahead and take the chance to update everything to the comicstory template, and wait for someone to get a chance to restore it. (It exists, it's just sorta... bleh at the moment.)
Here's the code:
{{Comicstory|
|title=
|seriesissue=
|prev=
|next=
|seriesissue2=
|prev2=
|next2=
|image=
|caption=
|publisher=
|date=
|coverdate=
|script=
|pencils=
|inks=
|colors=
|letters=
|editor=
|continuity=
}}
Everything between "coverdate" and "continuity" should be changed to match the credits style in the book itself. --Jeysie 01:07, 20 March 2009 (EDT)
- The comic template is Template:Comicstory, for those unaware. —Interrobang 01:23, 20 March 2009 (EDT)
I thought I knew most of the templates, but I just discovered imbedding .ogg files is done using Template:listen. Category:Templates is useful to have bookmarked. :) --abates 04:05, 20 March 2009 (EDT)
Does anyone remember what the syntax for the Wookiepedia template was? (Looks like it was too new to get Google cache saved.) --Jeysie 04:51, 20 March 2009 (EDT)
Also, can anyone remember what all the right parameters are for the series part of the Episode template? I'm trying to restore a Victory episode page, and I can't get the series name to show up right. --Jeysie 06:57, 20 March 2009 (EDT)
Unrecovered pages
Is there a comprehensive list of pages that didn't get updated by the script? I just bumped into Armada Overload's page by chance, and knew because I'd worked on it that it was out of date, but... that's about the worst possible way to find such pages. -- Repowers 23:20, 19 March 2009 (EDT)
- Hrm, did it still have the Wikia spam at the bottom? -Derik 00:15, 20 March 2009 (EDT)
- Because they don't have Categories, you can find a list at Uncategorized Pages. I'll be going through those over the weekend as a priority. --abates 00:31, 20 March 2009 (EDT)
- Oh, wait, wrong problem. Those pages still have the Wikia spam at the bottom, so we should be able to search for them. --abates 01:02, 20 March 2009 (EDT)
- We need to run a script again, then, 'cause it's well over 4,000 pages. -- Repowers 01:15, 20 March 2009 (EDT)
- Dang, a lot of those are accounted for by pages which have moved since last June. Or were uploaded with the wrong name because the title had a ' in it. And quite a few of them are redirects (that damn Wikia spam gets everywhere. --abates 01:44, 20 March 2009 (EDT)
- Not just ', but periods too. All articles starting with "S." got uploaded into S, as you can see from the history. Ditto for Dr. —Interrobang 01:51, 20 March 2009 (EDT)
- I wonder if there's an easy way to get a list of articles where that's happened. I guess that will have happened with the articles starting "G.I. Joe" as well. --abates 07:29, 20 March 2009 (EDT)
- Not just ', but periods too. All articles starting with "S." got uploaded into S, as you can see from the history. Ditto for Dr. —Interrobang 01:51, 20 March 2009 (EDT)
- Dang, a lot of those are accounted for by pages which have moved since last June. Or were uploaded with the wrong name because the title had a ' in it. And quite a few of them are redirects (that damn Wikia spam gets everywhere. --abates 01:44, 20 March 2009 (EDT)
- We need to run a script again, then, 'cause it's well over 4,000 pages. -- Repowers 01:15, 20 March 2009 (EDT)
- A lot of these are redirects, it turns out. Dozens and dozens of (UT) redirects, among others. God I hate those things! I thought I'd stamped them out forever! Hopefully we can turn a bot loose on them to delete them? -- Repowers 09:10, 20 March 2009 (EDT)
I think a big problem is all the pages with slashes and periods in their names... it would help immensely were we able to actually sort those out automatically. It's possible to just go through those ones, and use the same files, but pass a --title parameter to the import script so it gets it right... but unfortunately, that's beyond my shell scripting abilities at the moment. Also: abates, here are the cached pages you requested: [1] is the preprocessed set, and [2] is the processed set. --Suki Brits 01:54, 20 March 2009 (EDT)
- Actually, maybe it's simpler than I thought. I'll give it a shot tomorrow, unless anyone has any suggestions. --Suki Brits 02:01, 20 March 2009 (EDT)
- Thanks very much for that! I'll start restoring some of the articles which didn't convert properly. --abates 04:24, 20 March 2009 (EDT)
- I was able to spot what the problem was with the articles which didn't convert, and the HTML to Wiki tool on my site has been updated to cope with them. --abates 05:30, 20 March 2009 (EDT)
Oh gods. Somebody please tell me I don't have to restore all of More Than Meets The Eye's content and its 300+ links by hand. -- Repowers 08:45, 23 March 2009 (EDT)
Misconverted Pages Discovered
The coverted code for A Battle... and Then... got shunted into A Battle instead. (I don't want to fix it myself because my browser doesn't display the Japanese characters right, so I might lose them.) --Jeysie 05:21, 20 March 2009 (EDT)
- I've shifted the code over to the right place. --abates 05:30, 20 March 2009 (EDT)
Busted-arse pages
How come Armada Starscream's page is truncated? Was there something wrong with the cache we used? The Google cache has since changed to our post-crash Wikia version. --FFN 09:44, 20 March 2009 (EDT)
- The cache file in Suki's zip is fine. It's on the list of articles I have which didn't import properly anyway, so it'll be fixed in due time. --abates 17:22, 20 March 2009 (EDT)
Altogether missing pages
Apparently Optimus Primal toys fell through the cracks. http://74.125.47.132/search?q=cache:45MZwyVRoFkJ:tfwiki.net/wiki/Optimus_Primal_toys+TFWiki+Optimus+Primal+toys&cd=1&hl=en&ct=clnk&gl=us --Thylacine 2000 10:02, 20 March 2009 (EDT)
- Optimus Prime (Animated) did too - I can't find it in the big bunch of cache files from Suki, and it's gone from Google's cache. --abates 18:04, 20 March 2009 (EDT)
- I just updated AniPrime as best I can from the Yahoo cache, which actually seems to have been up-to-date, albeit ugly. Thank you kindly for resurrecting Primal! --Thylacine 2000 11:39, 21 March 2009 (EDT)
- All Omega Supreme pages are missing from the big cache files, too. That's just great. --FFN 11:22, 21 March 2009 (EDT)
- Both still on Googlecache, thank goodness:
http://74.125.47.132/search?q=cache:z2NCqQWjR2kJ:tfwiki.net/wiki/Omega_Supreme_(G1)+TFWiki+Omega+Supreme&cd=1&hl=en&ct=clnk&gl=us http://74.125.47.132/search?q=cache:36HJ3soPZzsJ:tfwiki.net/wiki/Omega_Supreme_(Animated)+TFWiki+Omega+Supreme&cd=2&hl=en&ct=clnk&gl=us --Thylacine 2000 11:43, 21 March 2009 (EDT)
- Okay, so, AniSupreme has already been restored by some kind soul. I'm trying G1Supreme, but... in order for me to locate the Google cache page, it highlights the search terms, and then when I save that as HTML and convert it through Abates' script, there are shittons of wiki-fied name color instructions all over everything. And also the whole page is just one teeny column wide too. What am I doing wrong? --Thylacine 2000 11:57, 21 March 2009 (EDT)
- Don't worry, I just spent the past 40 minutes fixing G1 Omega Supreme's article. Unfortunately Energon Omega's Google cache has been updated to March 16th Wikia Style, though I suspect not much work was one on that article anyway. Who the hell forgot to backup the Omega articles, anyway?
- BTW, I *knew* in my gut that we hadn't finished backing up yet, which is why I questioned Jeysie's decision to change the front page message from asking for Google cache help to simply cleaning up. --FFN 12:09, 21 March 2009 (EDT)
- Got Energon Omega Supreme's cached page via MSN, although the last mod date is November 18, 2008. --Lonegamer78 16:52, 21 March 2009 (EDT)
- The O articles were all uploaded by G1MarvelBlaster to File Qube, so many of them can be recovered from there as per the Finish Summary. I'm guessing that the files weren't grabbed off there before the big update? --abates 00:52, 22 March 2009 (EDT)
- Got Energon Omega Supreme's cached page via MSN, although the last mod date is November 18, 2008. --Lonegamer78 16:52, 21 March 2009 (EDT)
- Orbital bounce is waaaay out of date, too. Sounds like we missed some O's? -- Repowers 17:13, 21 March 2009 (EDT)
- Definitely missed Overlord! Luckily Google still has a current cache reflecting all the work McFeely did on Masterforce: http://74.125.47.132/search?q=cache:bv8tsuZr-rgJ:tfwiki.net/wiki/Overlord_(Masterforce)+TFWiki+Overlord&cd=1&hl=en&ct=clnk&gl=us Can someone help? As I mentioned above, every time I try to grab a cache I end up saving a page full of color-cues for my search terms. --Thylacine 2000 18:28, 21 March 2009 (EDT)
- Get rid of anything after the url of the article. http://74.125.47.132/search?q=cache:bv8tsuZr-rgJ:tfwiki.net/wiki/Overlord_(Masterforce) 18:39, 21 March 2009 (EDT)
- As a side note, just so people know, I actually DO have backups of (I believe) all my Masterforce and Victory character articles, and the pictures that go with them, on my hard drive. The hard drive inside the laptop that died last week. So, there is no GREAT need to rush to recover google caches on those, if you have better things to be doing. I haven't checked them all myself yet to see if there are any that have been destructed. They info is all there... I just... can't... get to it right now. - Chris McFeely 18:46, 21 March 2009 (EDT)
- Definitely missed Overlord! Luckily Google still has a current cache reflecting all the work McFeely did on Masterforce: http://74.125.47.132/search?q=cache:bv8tsuZr-rgJ:tfwiki.net/wiki/Overlord_(Masterforce)+TFWiki+Overlord&cd=1&hl=en&ct=clnk&gl=us Can someone help? As I mentioned above, every time I try to grab a cache I end up saving a page full of color-cues for my search terms. --Thylacine 2000 18:28, 21 March 2009 (EDT)
- Okay, so, AniSupreme has already been restored by some kind soul. I'm trying G1Supreme, but... in order for me to locate the Google cache page, it highlights the search terms, and then when I save that as HTML and convert it through Abates' script, there are shittons of wiki-fied name color instructions all over everything. And also the whole page is just one teeny column wide too. What am I doing wrong? --Thylacine 2000 11:57, 21 March 2009 (EDT)
- (*BOOM*) "Komoribreast" do not exist anymore!--Thylacine 2000 00:27, 22 March 2009 (EDT)
- That's because it was a redirect to Kōmoribreast, I think! --abates 00:55, 22 March 2009 (EDT)
HTML to Wiki converter
I think Derik mentioned one he did as well, but here's my HTML to WikiCode converter. --abates 02:46, 20 March 2009 (EDT)
- Thankee, Sai. (I know I had a comic page I wasn't looking forward to hand-rearranging.) --Jeysie 03:26, 20 March 2009 (EDT)
- Is that the one used to filter the incoming page by the import script? -Derik 03:29, 20 March 2009 (EDT)
- Yes, though there are a few improvements made since the import, such as handling Wikipedia interwiki links. --abates 03:57, 20 March 2009 (EDT)
- What should us laypeople do about the misconverted html pages for now? It's pretty embarrassing to have our front page link to an article that is essentially unreadable (Defiance issue 2). --FFN 08:08, 20 March 2009 (EDT)
- Fixed that page. Basically, just download the big tar.gz of "Before" pages Scout linked to earlier in this page (7-Zip can open just about everything, if you need a program to open it with), then run the needed page through Abates' converter manually. --Jeysie 08:31, 20 March 2009 (EDT)
- That 52 meg file? Yikes. Too big for me. --FFN 09:27, 20 March 2009 (EDT)
- I don't know if this is the best course of action, but if you just want individual pages, they're still here: [3] --Suki Brits 12:26, 20 March 2009 (EDT)
- Is there some reason why some of the cached files on the server aren't saved in html but as text files, thus rendering Abates' HTML converter useless? I see the ones marked by a question mark cannot be converted. --FFN 10:47, 21 March 2009 (EDT)
- They're all HTML files, regardless of extension. --Suki Brits 11:55, 22 March 2009 (EDT)
- Is there some reason why some of the cached files on the server aren't saved in html but as text files, thus rendering Abates' HTML converter useless? I see the ones marked by a question mark cannot be converted. --FFN 10:47, 21 March 2009 (EDT)
- Fixed that page. Basically, just download the big tar.gz of "Before" pages Scout linked to earlier in this page (7-Zip can open just about everything, if you need a program to open it with), then run the needed page through Abates' converter manually. --Jeysie 08:31, 20 March 2009 (EDT)
- What should us laypeople do about the misconverted html pages for now? It's pretty embarrassing to have our front page link to an article that is essentially unreadable (Defiance issue 2). --FFN 08:08, 20 March 2009 (EDT)
- Yes, though there are a few improvements made since the import, such as handling Wikipedia interwiki links. --abates 03:57, 20 March 2009 (EDT)
Found A Temp Fix
I dont know how long it will last, but I tried to fix Thundercracker, and well, it worked. (Look here: http://tfwiki.net/wiki/Thundercracker_(Armada) ) All I did was go to [EDIT: the Wikia site] and copy over the new stuff with the old. Don't know how long it will last tho. Vipes 11:54, 20 March 2009 (EDT)
- This could only possibly work on the smallest, lamest, most forgotten articles that we had here--something that had not been altered at all since June. As it happens, clicking on "compare versions" showed that we had made some small additions to Armada Thundercracker that now, in its Wikia state, are gone. Could you please go through it and reinsert the paragraph(s)? (I already closed the window, or else I'd have done it myself).
- As a rule, we want our content to be DIFFERENT from theirs, for Google rankings and other important reasons; if, hypothetically, a person were to port some old Wikia page over here just to get working wiki software tags, they would then still have to dig through our own March-era files and/or cache and reinsert the modern text, one by one. If anything, that would probably be the slowest procedure available to us. --Thylacine 2000 12:14, 20 March 2009 (EDT)
Can we clean house on this page?
This page has gotten pretty out of control. I have no idea what's now relevant and what's obsolete; it's totally incomprehensible for newcomers.
At this point, the things we need on this page are:
- a quick summary of recovery efforts to date (google caching of pages and talk pages, FTP uploading, filtering, auto-updating, and the subsequent mess)
- A list of remaining problems, including:
- list of pages and talk pages that couldn't be recovered from Google. We should snag the latest versions from Wikia if they're old enough.
- if possible, list of pages that didn't auto-update
- list of stuff to be auto-deleted, like (UT) disambiguation pages.
- problems on the pages that did update. So far this seems to be templates (factions, navigation, stubs, notices, etc.) and references.
- list of images that need their descriptions, credits, copyrights and categories restored.
- succinct summary of what's to come. Can we fix the problematic stuff (templates, etc.) with scripts? If so, awesome; if not, we need a master list to start divvying up the workload again.
Everything else, including this section, can go in an archive. -- Repowers 13:03, 20 March 2009 (EDT)
- All right, I started an attempt at an outline. (It should be taken as what will be left over when everything else is removed to the archive, which is why the headers are done the way they are.) --Jeysie 18:16, 20 March 2009 (EDT)
- Shouldn't the outline be moved off this talk page to Transformers Wiki:Community Portal/Damage Control Central? - SanityOrMadness 19:32, 20 March 2009 (EDT)
Summary Outline (remove this header after archiving everything else)
Summary of Events So Far
- For further information, see: Transformers Wiki:Bookworm Virus
(Needs to be added)
Google Cache Pages
Saved
- Main
- Template
- Help
- Transformers Wiki
- Talk 0000-0499, 8600-8921
Exceptions
Akari Hibino- 100% -Derik- All Fall Down (disambiguation)
- Charlie Bodin
- Cheyne
- Crown
- Cybertron Redux
- Deep Sea Discovery! Vector Prime
- Dirt Boss (disambiguation)
- Do Over
- El Greco
- Empire7
- End of the Road (Titan)
- Enter, Lio Convoy Typhoon
- Esther Scott
- Frederic Doss
- Fushigi Yamada
- Geonosis
Gods? Devils? The PretendersGutcruncher (G1)- Hasbro Q&A March 2009 question submissions
High Beam- 100% -Derik- Hochu Otsuka
- Insecticon Attack!
- Kazuhiko Gōdo
Kingbolt- 100%Linkage Part 13- 100%List of computers- 100%- Lloyd Goldfine
Matrix Quest (disambiguation)- 100% -Derik- Matrix Quest (Titan)
- May 26
- Maya Klayn
- Mini-Vehicle
- Mirage (disambiguation)
- Miroslaw Neinert
- Nightscream (Energon)
Nokia-botNull ray- Odette Yustman
Optimus Primal toysOptimus Prime (Animated)- 100% -abates- Quick Bow
- Robot Masters (cartoon)
Robot Masters (pack-in comic)Scoop (G1)- 100% -Derik- Scoop (Shattered Glass)
- Seven
- Showdown (disambiguation)
- Slaves of the Insecticons
- Staff Sergeant Tracy
- Stag Swamp
- Stalker (G.I. Joe)
Studio 4˚C- Tentakil (disambiguation)
Terrorcon (disambiguation)- 100% -Derik- The Big Book of Coloring Fun
- The New Battle Begins!
- The Smashing Pumpkins
- The Treacherous Attack of the Decepticons
The Victory Warriors Enter!!- 30% (stub) -DerikThundercracker (Shattered Glass)- 100% -Derik- Tight Shot
Tlaloc- 100% -DerikTsutomu Kashiwakura- 100% -Derik- Vector
Not Saved
- Talk 0500-8599
- Template talk
- Help talk
- Transformers Wiki talk (non-Community Portal pages)
Restoration
Useful Resources
- Processed pages
- Unprocessed cache files
- Online folder of unprocessed cache files
- 7-Zip (for opening archived files)
- Abates' online cache page processor
- Derik's online cache page processor
List of Non-auto-updated Pages
(Needs to be added)
Images
(Needs to be added)
Things That Can Get the "Speedy Delete" Template
(Needs to be added)
Problems You're Likely to Run Into
- Re-templating. Lots of it.
- References will need to be redone by hand, via putting empty <ref></ref> tags in the proper places in the article, then picking out the actual reference from under the Footnotes/References section and sticking them in the Ref tags, then replacing the leftover messed-up code with the <references/> element. You may want to check against the unprocessed cache page to make sure it's all correct.
Template Code You Might Need
Breaking Lists Into Columns
{{columnlist|(number of items per column)|
(list of items)}}
Comic Issues
Items between "coverdate" and "continuity" should be customized to match the issue's actual credits and wikilinked to the person's name.
{{Comicstory|
|seriesissue=(wikilinked comic series the issue is part of)
|prev=(previous issue)
|next=(next issue)
|seriesissue2=(wikilinked second comic series the issue is also part of, if applicable)
|prev2=(previous issue in the second series, if applicable)
|next2=(next issue in the second series, if applicable)
|title=(title of the issue, if applicable)
|image=(cover image)
|caption=(caption for the cover image)
|publisher=(wikilink to the publisher)
|date=(month, day, and year the issue was released)
|coverdate=(month and year on the issue itself)
|writer=
|penciler=
|inker=
|colorist=
|letterer=
|editor=
|continuity=(wikilink to the continuity the issue takes place in)
}}
Episodes
Items between "production company" and "animation studio" should be customized to match the episode's actual opening credits and wikilinked to the person's name.
{{episode|
|series=(series the episode is in (codes: g1, headmasters, masterforce, victory,
zone, g2, beast wars ii, beast wars neo, beast machines, rid, armada, energon,
cybertron, animated))
|ep=(the episode number)
|prev=(previous episode)
|next=(next episode)
|series2=(second series the episode is in, if applicable (codes: g1, headmasters,
masterforce, victory, zone, g2, beast wars ii, beast wars neo,
beast machines, rid, armada, energon, cybertron, animated))
|ep2=(the second series episode number, if applicable)
|prev2=(previous episode in the second series, if applicable)
|next2=(next episode in the second series, if applicable)
|title=(title of the episode)
|japanese=(title in Japanese for Japanese-origin episodes, if applicable)
|romaji=(romanization of Japanese title)
|translation=(translation of Japanese title)
|image=(some representative image)
|caption=(caption for the image)
|production code=(episode's production code, if known)
|production company=(wikilink to the company that produced the episode)
|writer=
|director=
|animation studio=(wikilink to the studio that animated the episode)
|airdate=(the date the episode aired)
|continuity=(wikilink to the continuity the episode takes place in)
}}
Featured Characters
First column is "Autobots", second is "Decepticons", third is "Humans", fourth is "Others", fifth is "Misc." These headers can be changed if needed by using |h1=(new header)|c1= in place of the c1=. Code sections with no characters in them can be deleted.
{{featuredcharacters
|c1=
|c2=
|c3=
|c4=
|c5=
}}
"For Further Information"
Used whenever an article has had some of its sections separated into separate articles.
{{see|(title of separated section page)}}
Image Galleries
Used to format lots of images into a swanky-looking table.
<gallery> Image:(filename)|(caption) </gallery>
"See Main Article"
Whenever an article has had some of its sections separated into separate articles, this should be placed at the top of the separate articles to link them back to the page they were split from.
{{main|(title of main article)}}
Specifying How an Article Should Be Sorted in Categories
For instance, if the article is named "Simon Furman", but you want it sorted as "Furman, Simon" in all categories.
{{DEFAULTSORT:(altered title)}}
Speedy Delete
Slap this on any page that can definitely be deleted. Needless to say, don't stick this on a page unless you're absolutely certain it can be deleted!
{{speedy|(reason for delete)}}
Stubs
- General: {{stub}}
- Character pages: {{charstub}}
- Character pages that need fiction: {{charstubfiction}}
- Character pages that need toys: {{charstubtoys}}
- Comic issues: {{issuestub}}
- Episodes: {{epstub}}
Year Page Navigation
Puts a nice navigation box on the top of year pages.
{{yearofthe|(year)}}
Stuff Still to Come
(Needs to be added)
Discussion
Probably the wrong place, but I have to ask this of those that handle coding/templates: Is it possible to fix the Recent Changes page to have the "View (newer 500) (older 500)" usable again? Just a thought. --Lonegamer78 22:18, 20 March 2009 (EDT)
So... how does this work? Is it basically if I see a garbled page I should feel free to clean it up? - Starfield 00:02, 21 March 2009 (EDT)
- Pretty much. If you see something that's really bad, (like the code's all & lt; and & gt; and stuff), try downloading the zip file of unprocessed cache files, going to Abates' converter, choosing the appropriate cache file, and running it through again. That should make it a little more managable. --Jeysie 00:48, 21 March 2009 (EDT)
Lost articles
I ran my updated script over all of the cache files, so I could restore the ones which didn't update properly the first time. However some of the cache files are blank. Crossed out ones I have redundant copies of.
Destined Confrontation - The Children of Good and EvilEndgame Pt. I: The Downward SpiralEndgame Pt. II: When Legends FallEndgame Pt. III: Seeds of the FutureGeneration_2Genesis: The Art of Transformers100% compliantHeroic LegendRevelations Part I: DiscoveryRevelations Part II: DescentRevelations Part III: Apocalypse
There were a couple of others, such as the Q&A which are already on the site, and I'm not sure which of the several "Heroic Legend" articles that one file is for, but perhaps someone has a copy of the other three? --abates 18:20, 20 March 2009 (EDT)
- The big unprocessed cache zip has:
- Heroic Legend: 2010 Wars
- Heroic Legend: Head On, Master Warriors!
- Heroic Legend: Optimus Prime VS Megatron!!
- Also, considering that the "proper" Generation 2 article is a redirect, and it looks like we have all of the other main G2 articles, I don't think we need to worry about that one. --Jeysie 18:50, 20 March 2009 (EDT)
- Cool! And I just grabbed a copy of Destined Confrontation out of MSN's cache, so consider this section closed! --abates 20:54, 20 March 2009 (EDT)
I have no idea where to ask about this
Are we able to rescue stuff from namespace pages, like sandboxes? I had an article for Bata, maker of licensed transformer footwear, almost raedy to go before the thing. Hooper_X 09:38, 21 March 2009 (EDT)
- I was able to salvage the sandboxes that I made links to on my userpage-- because when google spidered my userpage, it saw links to them and spidered them.
- If not... -Derik 19:41, 22 March 2009 (EDT)
Template Help
So the second series on "A Heroic Battle" just isn't showing up. Did I do the code wrong (although I could swear it looks right), or is the template still a little messed up?
- Never mind, I figured it out, although I should note that RiD, at least, doesn't seem to have the auto-prev and auto-next set up. --Jeysie 01:43, 21 March 2009 (EDT)
Also, I'm still curious about the Wookiepedia template... should I just erase it if I come across it, for now? --Jeysie 01:22, 21 March 2009 (EDT)
I see some of the spiffy new templates aren't working, like the new series of nameless characters templates (noname-nickname, etc.) and the faction icons. When I fix a page, should I restore those with the anticipation that the templates will return? - Starfield 12:34, 21 March 2009 (EDT)
- That's what I've been doing. At least with the factions one (hopefully I got the faction names right!) Unfortunately I wasn't that familiar with the new nameless character templates, so I've been leaving them as "noname", and I guess once the templates have been sorted out, they'll be easy to spot by looking at the pages which link to noname. --abates 23:20, 21 March 2009 (EDT)
When a page's code is totally effed...
I've discovered that when a page's code is totally effed by the import script, it can usually be re-condition by running it through my page filter script. (located here)
The trick is to hit "wickify" not once, but TWICE. The second round generally sorts out most of the effed-uppingness. -Derik 12:38, 21 March 2009 (EDT)
- I tried that with Student and nothing changed. Are there different kinds of effed-ness? Thylacine 2000--74.73.131.210 14:11, 21 March 2009 (EDT)
- I accuse you of doing it wrong. It made a signifigant difference when I did it.
- Open the student page in editing mode.
- Copy the text out.
- Paste the text into the top text field of the Rewiki Text Page
- Click "Wikify"
- Click "Wikify" again
- Copy the text back out of the bottom text field, and put it into the student article.
- Preview
When I did this, Student went from being a total mess to something with sections, working links and a garbled skeleton that was clearly a notice box at the top of the page. -Derik 14:41, 21 March 2009 (EDT)
The big sweep
Okay- Deceptitran has a list of ~24,000 pages the wiki either has or thinks it OUGHT to have. (There's holes since this was compiled from imperfect lists, but it's what we've got.)
I'm getting ready to start coding a slow sweep over all the pages-- this will take several DAYS. The obvious edit to make is the removal of all the wikia links (which google has begun to notice.) Can anyone think of any other simple edits that should be added in that mix? (I want to be conservative and not fuck up the code even MORE than it already is.)
I'm probably gonna run a passive (no edits, much faster) sweep first and just try to figure out what pages are missing. -12:52, 21 March 2009 (EDT)
- Big sweep is ~2% complete. Loss rate is around 1%. Good news-- most of the "lost" pages are false-positives. (Crap that got onto our paster page-lists by accidents.) actual flat-out missing pages seems closer to ~0.3%.
- (Of course, the estimate of what's missing is only as good as our original lists. And if they were on the original lists-- then they should have gotten backed up and restored, right?)
- It's something anyway. I'll post an actual list when the sweep finishes. -Derik 13:23, 21 March 2009 (EDT)
Excellent. Excellent excellent. There has just been waaaay too much screwed up stuff to take it on by hand. This is good to hear. -- Repowers 14:28, 21 March 2009 (EDT)
Can anyone think of any other simple edits that should be added in that mix?
- Perhaps, for the sake of argument, replace < and > with < and >? I realise the result wouldn't be ideal, but it would paper over a lot of cracks in one fell swoop. - SanityOrMadness 18:47, 21 March 2009 (EDT)
- Getting the last of the Wikia links? I'm still finding 'em.--RosicrucianTalk 19:31, 21 March 2009 (EDT)
- I second that both removing Wikia links and converting < and > are the two big ones. Converting over the simpler/more standard templates would also be a plus, but I'm not sure how possible that is without risking further mangling. --Jeysie 22:00, 21 March 2009 (EDT)
- Getting the last of the Wikia links? I'm still finding 'em.--RosicrucianTalk 19:31, 21 March 2009 (EDT)
- Any line which looks like this:
<div style="border: 2px solid rgb(238, 238, 238); margin: 0pt auto 0.2em; padding: 2px; width: 90%; background-color: rgb(239, 239, 239); text-align: left;">'''Note:''' ''...''</div>- ...can be turned into {{note|...}}. There are a lot of these. --abates 23:50, 21 March 2009 (EDT)
- That nice-- except that's not what the Note template actually LOOKS LIKE int he page text. Either it's puer HTML, or the HTML's been escaped or... something, because Deceptitran isn't finding any instances of it when I run tests.
- Can someone find me an example page with the screwed-up "Note" template on it (currently or just in history) so I can figure out how to code a fix? -Derik 19:54, 22 March 2009 (EDT)
- For instance: <div style="width: 90%; margin: 0 auto .2em auto; background-color:#efefef; border: 2px solid #eeeeee; padding: 2px; text-align: left;">'''Note:''' ''[[Defiance]] changes the past relationship between Megatron and Optimus Prime as previously seen in IDW's Prime Directive prequel where they ruled side by side as equals. As presented here, Defiance establishes Optimus Prime as merely the leader of the Cybertronian science sector, and an underling to Megatron.''</div> --Jeysie 20:42, 22 March 2009 (EDT)
Sweep results
*yawn* Good nap. 227 missing pages-- some of which is junk and not actually missing, some of which is stuck on Markons, etc. I see that some of this has been reccover while I slept, yay!
- '
- "
- (
- Ireneusz Załóg
- Jörmungandr
- Pick-Up (Bat-Robô)
- Rio Gráfica Editora
- 1
- 1984_media
- 2
- 3
- 4
- 6
- 9
- A
- All Fall Down (disambiguation)
- Banjō Ginga
- Binbōgami
- Būpink
- Bumblespud
- C
- Charlie Bodin
- Cheyne
- Chōkon Power
- Crown
- Cybertron (planet)/Gallery
- Cybertron Redux
- Daisuke Gōri
- Dark Designs (Titan)
- Destined Confrontation: The Children of Good and Evil
- Do Over
- Doryū
- DreamMix TV World Fighters
- Dreamwave Generation One continuity
- Drench (Shattered Glass)
- Drift (episode)
- Drill Bit (Universe)
- Dropbox
- Dropshot (Shattered Glass)
- Dublin James
- Duel in the Labyrinth
- Dungeons & Dinobots
- Durguth
- Dutch
- Dynamo
- E
- Each One Fights...
- Echo Across the Galaxy! Bell of Love!!
- Empire
- End of the Maximals!?
- End of the Road (Titan)
- Enter, Lio Convoy Typhoon
- Esther Scott
- F
- Fight! Super Robot Lifeform Transformers: The Comics
- Frederic Doss
- Fushigi Yamada
- Gairyū
- Geonosis
- Glen's grandmother
- Gōryū
- Grimlock (G1) toys
- H
- Hasbro Q & A
- Hasbro Q&A December 2008: Answers
- Hasbro Q&A December 2008: Question submission
- Hasbro Q&A March 2009: Answers
- Hasbro Q&A March 2009: Question submission
- Hitoshizuku Amaō
- Hōchū Ōtsuka
- Hoshinochō
- Hozumi Gōda
- I
- Image:Autobots+Lets+Make+Out-1836.jpg
- Image:Autobots+Lets+Make+Out-6341.jpg
- Image:Autobots+Lets+Make+Out-7270.jpg
- Image:BergerInc_Tanks&SolarPlant.jpg
- Image:Blasterbluesop&mdance.jpg
- Image:Coredevastator&prime.jpg
- Image:Coremegs&wheeljack.jpg
- Image:Dinobotislandratch&spark.jpg
- Image:G1_-_Fastlane_&_Cloudraker_-_Boxart.jpg
- Image:Gambitastro&crystals.jpg
- Image:Gambitastrotrain&scream.jpg
- Image:Gambittitans&cosmos.jpg
- Image:Insectsyndromeprime&megs.jpg
- Image:Jose+santacruz+menor.jpg
- Image:Maketrackshoist&huffer.jpg
- Image:Nergill_&_Troops.jpg
- Image:Omegasupreme&creature.jpg
- Image:Search4atelita&prime.jpg
- Image:Tfalost&foundbulkheadhocky.gif
- Image:Tud2bee&spikeembrace.jpg
- Image:Universe_Soundwave&Space-Case_toy.jpg
- Industrial Light & Magic
- Insecticon Attack!
- J
- J. Falconer
- Jirō Saitō
- K
- Kakuryū
- Kazuhiko Gōdo
- Kenichirō Tanabe
- Kensō Katō
- Kenyū Horiuchi
- Kōhei Kowada
- Kōichi Tōchika
- Kōji Totani
- Kōji Yusa
- Kōki Kataoka
- Kōmoribreast
- Kōzō Shioya
- L
- Lloyd Goldfine
- M.A.R.B.
- Mantarō Iwao
- Masaharu Satō
- Masashi Endō
- Matrix Quest (Titan)
- Matrix Quest(Titan)
- May 26
- Maya Klayn
- Meltdown's Experiments
- Mini-Vehicle
- Mirosław_Neinert
- Momotarō
- Monster Maezuka
- Mujō
- Naoki Tatsuta
- Nobuyuki Saitō
- Odette Yustman
- Off-Road Cycle
- Ohio
- Omega Sentinel (disambiguation)
- Omega Terminus
- Onslaught (ROTF)
- Open Fire!
- Optimus (episode)
- Optimus Prime (Armada) toys
- Optimus Prime (Movie) toys
- Optimus Prime cookie jar
- Optimus Prime Oral Care Station
- Osamu Saka
- Overbite (Shattered Glass)
- Overlord - Terror of the Chōkon Tornado
- Overlord (rank)
- Owen Hurley
- P
- Pacific Ring of Fire
- Pack-in material
- Package art/Gallery
- Packaging
- Palace
- Panini
- Panini Armada issue 2
- Panini Armada issue 3
- Panini Armada issue 4
- Panini Armada issue 5
- Panini Armada issue 6
- Panini Armada issue 7
- Panini Armada issue 8
- Panini Armada issue 9
- Paper Magic Group
- Paradron communicator
- PARD
- Pīpō
- Press Release: Teletraan-1 Wikia moves to TFWiki.net
- Professor Gō
- Q
- Quick_Bow
- R
- Rairyū
- Ratchet's_EMP_generator
- Reverb_(Cybertron)
- Robot_Masters_(cartoon)
- Ryō Naitō
- Ryōichi Tanaka
- Ryōka Yuzuki
- Sanryō Odaka
- Save_the_Little_Girl!_The_Chōjin_Warriors,_the_Godmasters
- Scoop_(Shattered_Glass)
- Seizō Katō
- Shinichirō Miki
- Shizuku Amaō
- Shōhei Kohara
- Shōji Kawamori
- Shōki
- Showdown_(disambiguation)
- Shūta Gō
- Slaves_of_the_Insecticons
- Smith_(disambiguation)
- Staff_Sergeant_Tracy
- Street_Action_Mini-Con_Team_(Armada)
- Takurō Kitagawa
- Talk:Hot_Shot_(Unicron_Trilogy)
- Teiyū Ichiryūsai
- Teletran-1:_The_Transformers_Wiki:About
- Template-episodenav-dev
- Tesshō Genda
- The_Big_Book_of_Coloring_Fun
- The_New_Battle_Begins!
- The_Smashing_Pumpkins
- The_Treacherous_Attack_of_the_Decepticons
- Tight_Shot
- Toaster_bot
- Transformers_Mix_&_Match
- Transformers_Wiki_talk:Community_Portal/Cache_Recovery
- Transformers_Wiki:WikiaBotTests
- Transformers:_Binaltech_&_TF_Collection_Complete_Guide
- Trevor_Hutchison
- Ultra_Magnus_(Universe_Spy_Changer)
- Ultra_Magnus...to_the_Rescue?
- Units_of_time/Continuity
- User:M_Sipher/Sandbox
- Vector
- Yōichi Kobiyama
- Yōji Ietomi
- Yōko Kawanami
- Yokuryū
- Yūgo Ōgami
- Yūji Kishi
- Yūji Mikimoto
- Yūki Ōshima
- Yūsaku Yara
- Yūto Kazama
- Yukiyoshi Ōhashi
- Yumi Tōma
- I've recovered a bunch of those from Google's cache. Many of them are malformed because they have accents (I already went through and reuploaded as many of the accented articles as I could find). A chunk in the middle from Off-Road_Cycle to PARD are in G1MarvelBlaster's saved cache stuff. Mini-Vehicles is at Mini Vehicles, Ultra Magnus...to the Rescue? is at Ultra Magnus... to the Rescue?. I started trying to look for the remaining legit lost articles on yahoo and MSN, but I don't have any more time to work on it tonight. One of the message board people who saved stuff may have some of them? --abates 04:06, 22 March 2009 (EDT)
Filtering the sweep results
So, just from that list, once you take out (1) the links which have gone blue and (2) obvious errors of one sort or another (as well as the single-character stuff, the Teletran-1...About page was deleted after the big crash, f'rinstance. I stuck a {{speedy}} on it myself), that leaves these as actual pages which are completely missing, yes? [Not swearing none of these were redirects/etc - if they were, please send them to where they're meant to go and remove them from this list, huh? I've left the disambigs, tho.]
- Charlie Bodin (minor role in live-action movie - http://www.imdb.com/name/nm1835107/ )
- Cheyne (Unrecoverable. Yahoo doesn't have cache, but a search says "The planet Cheyne was one of several worlds explored by the Autobots ... Retrieved from "http://tfwiki.net/wiki/Cheyne" Categories: Generation 1 | Planets ...")
- Crown (unrecoverable)
- Digital Dagger (Unrecoverable. T.E.C.H. toy from first movie's toyline)
- End of the Maximals!?
- End of the Road (Titan) (Unrecoverable.)
- Esther Scott
- Frederic Doss
- Fushigi Yamada
- Geonosis
- Hasbro Q&A March 2009: Question submission
- Insecticon Attack!
- Kazuhiko Gōdo
- Lloyd Goldfine
- Matrix Quest (Titan) (Unrecoverable.)
- May 26 (Unrecoverable. Yahoo gives it as a result in a search, but no useful info)
- Maya Klayn
- Odette Yustman
- Robot Masters (cartoon)
- Scoop (Shattered Glass)
- Slaves of the Insecticons
- Smith (disambiguation)
- Staff Sergeant Tracy
- The Big Book of Coloring Fun
- The Smashing Pumpkins
- The Treacherous Attack of the Decepticons
- Tight Shot (Unrecoverable. T.E.C.H. toy from first movie's toyline)
- Ultra Magnus (Universe Spy Changer)
- Vector
- Yukiyoshi Ōhashi
63 pages as I type... obviously, this doesn't take account of stuff which exist but was reverted, but... - SanityOrMadness 16:42, 22 March 2009 (EDT)
- There are certainly more-- this list was generated by from lists of pages from ABates and FortMax we think we're SUPPOSED to have. (and thus pages we were trying to save the cache of.) Literally this is more a compilation of- "Stuff we KNEW we had to save we either fucked up on, or there was no cache of."
- If a page wasn't on the lists in the first place (for a variety of reasons) then it's not gonna be there. Sorta "Known Unknowns" vs. "Unknown Unknowns." -Derik 18:27, 22 March 2009 (EDT)
- Yeah, I figured that out pretty quickly after I typed that when I noticed the number of redlinks on extant pages vs. blue links on cached pages, especially around the Japanese stuff (half of the Beast Wars Neo episode articles appear to be gone, for instance). - SanityOrMadness 18:33, 22 March 2009 (EDT)
- Lonegamer has many (if not all, I dunno) of the Beast Wars Neo episodes in her saved caches, as well as lots of other Japanese article-related stuff, as I posted here earlier, for those folks who typically work on the Japanese articles. *hint*nudge*kick* --Jeysie 19:20, 22 March 2009 (EDT)
- The vast majority of the Neo episode articles were just skeletons and are available on the Other Place. It's not really that big of a deal. —Interrobang 20:55, 22 March 2009 (EDT)
- Lonegamer has many (if not all, I dunno) of the Beast Wars Neo episodes in her saved caches, as well as lots of other Japanese article-related stuff, as I posted here earlier, for those folks who typically work on the Japanese articles. *hint*nudge*kick* --Jeysie 19:20, 22 March 2009 (EDT)
- Yeah, I figured that out pretty quickly after I typed that when I noticed the number of redlinks on extant pages vs. blue links on cached pages, especially around the Japanese stuff (half of the Beast Wars Neo episode articles appear to be gone, for instance). - SanityOrMadness 18:33, 22 March 2009 (EDT)
And so it Begins...
Deceptitraan has begun to do an edit sweeps. I'm calling this "Pass 1." He will ATTEMPT to;
- Remove the wikia spam
- Fix the {{factions}} template
- properly format the {{disambig2}}'s
- Fix {{note}}s
This is going to be at least fourteen thousand edits, minimum. It's gonna take awhile. (The 'what's missing' sweep was 20-100 times faster because it didn't have to post edits.) -Derik 23:11, 22 March 2009 (EDT)
- While I think of it, I'm gonna add those filters to my page-fixer tool too. Let me knwo if that cause it to glitch. -Derik 23:21, 22 March 2009 (EDT)
- On-the-fly change-- Deceptitran is now also fixing links like [[Slumdog]] to simply read [[Slumdog]]. (The import script rendered everything as a two-part link.)
- I'll go back and re-check the 500-or-so articles already edited for this after. Note to self: The articles I want to re-check are the ones lacking flag2 in the database. -Derik 00:10, 23 March 2009 (EDT)
- Three little points, based on a tiny sample of Deceptitran's edits:
- Careful which side of the [[x]] links you grab. The general rule seems to be that the left hand side always has the first letter capitalised, and I've seen a couple of pages where [[continuity family]] has become [[Continuity family]] when the "c" should have stayed small.
- Just inadvertently confirmed, because Deceptitran filtered THIS page! (i.e., the "links" above". I reverted :)) It indeed took the capitalised left-hand side. - SanityOrMadness 01:13, 23 March 2009 (EDT)
- Can you grab the equivalent {{storylink}}s while you're at it? There's a lot of {{storylink|x|x}}s too for much the same reason.
- On Buzzsaw (Cybertron), the {{note}} initially got messed up. I think it's an edge case, since others seem to have gone through okay, but something to watch if you haven't fixed it.
- Otherwise, keep up the good work :) *wonders aloud if this might fix the article-counting problem - if a page gets edited, or even null-edited, it gets readded to the numbering if it's slipped out, doesn't it?* - SanityOrMadness 00:25, 23 March 2009 (EDT)
- Three little points, based on a tiny sample of Deceptitran's edits:
- What voodoo are you using to spot [[Slumdog]] links? I couldn't get that one worked out. --abates 02:37, 23 March 2009 (EDT)
- I was using a callback-- /\[\[(.+)\|(\1)\]\]/i --> $2.
- Find all links where the second term is the same as the first term, using case-insensitive match, and use the second term. (I was originally using the first, oops.)
- Problem is I eventually realized that would turn links to [[Energon Cube|energon cube]] into [[energon cube]], and only the first letter can be insensitive. :p
- Now I'm using a callback function. Like killing flies with a nuke. (And I'm gonna use this page to test that callback.) *sets up some bad links.* -Derik 03:42, 23 March 2009 (EDT)
- What voodoo are you using to spot [[Slumdog]] links? I couldn't get that one worked out. --abates 02:37, 23 March 2009 (EDT)
- energon cube - no
- energon cube - yes
- energon Cube - yes
- energon Cube - yes
- energon Cube - no
- Woohoo! It parsed 'em all right, we are good to go!
- It looks like I ran about 700 pages through the bad version of the filter but... I have a hard time making myself care. That's about 1.2% of our pages, and 98% of the time the difference between the two types of links will be nil or cosmetic. It's right going forward, but I'm not going back to fix the others.
- Also, I looked at the buzzsaw link you posted-- the {{note}} part got rewiki'd perfectly! It's just that it had a leftover HTML link in it!
- There's like 50 variants of <a href=""> on the site right now because browsers tended to save the files as THEY thought they looked.
- Open a page in firefox and hit "view source." Now highlight a chunk of a page and right-click "view source." THE CODE IS DIFFERENT. The right-click gives you the HTML as it's been shuffled around by the geko rendering engine, which puts all the classes, titles, hrefs and so on in the SAME ORDER, regardless of what their sequence was in the original HTML.
- I figure there's gonna be some MONSTER link-parsing function that takes all of this into account for another sweep after this. :p (at which point the anchor tag within that perfectly valid {{note}{ template would be correctly wikified.)
- There's nothing wrong with the {{note}} filter-- it's just that it only does {{note}}s. -Derik 03:57, 23 March 2009 (EDT)
Lonegamer's Cache
Lonegamer sent me all of her browser cache files the other day. I've already reposted a few I can recognize as being ones we lost, but there's some other possibly useful stuff in here, lots of Japanese episode and manga-related stuff, especially. I link to it for those who might need such things: http://miscfile.alienharmony.com/transformers/lonegamers-cache.zip --Jeysie 19:30, 21 March 2009 (EDT)
- Fantastic! I was able to use some of the files from here to fix a bunch of articles. --abates 19:33, 22 March 2009 (EDT)
G1MarvelBlaster's cache saves
It doesn't look like these were part of the original upload of files: [4]. I can't seem to download the files though. --abates 01:19, 22 March 2009 (EDT)
- I just downloaded Omega bomb cache article. Seems to be working alright. Is there some reason why people decided to not upload the files to scout's server so they could be restored en-mass? It seems to me that some people deciding to upload files elsewhere is the reason for so many articles not being updated. --FFN 01:41, 22 March 2009 (EDT)
- I know there was a problem with Scout's FTP being down at one point, so it's possible people decided to upload elsewhere during that time. (I personally had just uploaded the pages right after saving as a backup, before Scout set up her FTP, but I offered to reupload if needed... no one told me to, however.)
- I will note that G1MarvelBlaster and other folks linked to their uploads in the original finish summary, though, so they did inform that they uploaded it somewhere else. --Jeysie 02:04, 22 March 2009 (EDT)
- I managed to get Paradron communicator out by hitting download, copying and pasting the HTML code into a file, saving the file, and then processing it. --abates 02:38, 22 March 2009 (EDT)
- So does someone want to volunteer to grab these and reprocess them? You can skip all the October files, since I think Derek did those, and several of the Optimus Prime ones have already been done. Should be around 160 in total. I'd do it, but I have 600+ articles which are messes to try to reprocess. --abates 16:23, 22 March 2009 (EDT)
Zip file
Here is a 1MB zip file containing 131 of the cache files. I've left out the ones which I know for sure have already been updated. --abates 01:16, 23 March 2009 (EDT)
Why are we blocking the Wayback Machine?
I accidentally hit the Wayback Machine rather than MSN when I was trying to grab the cache for a page there, and was informed that:
- We're sorry, access to http://tfwiki.net/wiki/End_of_the_Road_(Titan) has been blocked by the site owner via robots.txt.
Wha-huh?! - SanityOrMadness 10:34, 22 March 2009 (EDT)
- There was nothing in our robots.txt that should stop the Internet Archive from viewing us, but there also wasn't anything useful in it anyway, so I've deleted the file. That's weird. --Suki Brits 12:02, 22 March 2009 (EDT)
- It's probably Archive.org's default answer-string for "I don't have it."
- Just like YouTube video player informs you a video is no longer available even if it it's just having a timeout error. -Derik 18:22, 22 March 2009 (EDT)
- No, the default is simply "Sorry, no matches" (example) - SanityOrMadness 21:01, 22 March 2009 (EDT)
- I believe the Wayback Machine's crawlers simply look for the presence of a robots.txt and not the content of it. At least that's the impression I've gotten from using the Wayback Machine.--Tigerpaw28 13:15, 23 March 2009 (EDT)
Page shrinkage
I took a stab at cleaning up the loose code on Galvatron (G1), restoring quotes and notes and whatnot, and now the page seems to be half its normal width. Uh, help? I mentioned this in the Summary box too, but for some reason it didn't appear in Recent Changes. --Thylacine 2000 14:17, 22 March 2009 (EDT)
Too big to wikify?
I have attempted to convert Optimus Prime (Armada) a few times, as recovering that will let us split the toy section out again. It seems to choke Derik's tool, though.--RosicrucianTalk 18:52, 22 March 2009 (EDT)
- Do it section-by-section. Derik's tool DOES have an upper limit, but if you put it through in smaller chunks, it'll work. - SanityOrMadness 18:56, 22 March 2009 (EDT)
- I've restored the toy page from G1MarvelBlaster's cache version. I'd suggest getting Optimus Prime (Armada) from there too, as it looks like that might be a more up-to-date version than was originally imported here. --abates 20:29, 22 March 2009 (EDT)
Incidentally, Optimus Prime (Armada) toys is still in Google's cache in its' own right. - SanityOrMadness 19:27, 22 March 2009 (EDT)
- It's in G1MarvelBlaster's archive of cache files too. --abates 19:36, 22 March 2009 (EDT)
- Yeah, my took is kinda hacky. I didn't know about the limit... but it doesn't surprise me. I've got an AWFUL LOT of "match everything" Regex's set to Multiline mode, with callbacks. Enough KB of text probably cause that to choke. -Derik 19:38, 22 March 2009 (EDT)
I noticed Sanity or Madness pulling his hair out over all the garbled talk pages-- both too long for my tool and indented which it doesn't handle.
Well it handles indents now. And especially or talk pages Version 3i. It just does indents and headers, so it can filter long pages. (You'll still have to break 'em up to fix everything else, but at least now they're more comprehensible when you try to.) -Derik 03:18, 23 March 2009 (EDT)
Working on scrambled pages
Some of the pages ended up completely scrambled - IE no HTML conversion and, you'll note, no categories. I've been restoring as many of them as I can, but some of the original cache files have really strange line breaks in them, often in the middle of HTML tags and both mine and Derik's scripts pretty much choke on them. I went through the articles on my list which start with the letter 'D' tonight, saving new copies of them from Google, Yahoo and MSN, however I couldn't recover these four:
So if anyone has spare copies of these, please speak up! :) --abates 04:46, 23 March 2009 (EDT)
- Done, I think. --Jeysie 05:34, 23 March 2009 (EDT)
- Woo! Cool! Having done a mass process of the rest of the files on my list, the following three are the only ones I couldn't retrieve from caches:
- It looks like the other 510 files processed all right, so now I just have to go through them all and paste them into the articles on teh Wiki --abates 05:45, 23 March 2009 (EDT)
- What did you code your tool in anyway? Do you want copies of my PHP callbacks?
- I really think Sweep #2 is just going to require a "Find an <a> tag, now pick it apart and put it back together no matter what order the fucker is in." function. Admittedly, this is more-or-less how I wrote the indenter-- it's a progressive parser, just like the Bad Old Days of write-it-yourself XML parsers. -Derik 06:40, 23 March 2009 (EDT)
- It's coded in Perl. Sure, a copy of the callbacks would be useful, thanks!
- I never could get the indenting right - it doesn't help that the HTML MediaWiki generates doesn't seem to close its <li> tags all of the time. --abates 07:02, 23 March 2009 (EDT)
- This is PHP not perl, but the basic concept applies...
function fixIndents($wiki_text){
$lines = explode("\n", $wiki_text);
$new_text = '';
$options = array( 'down' => '<dl>',
'up' => '</dl>',
'new_sib' => '<dd>',
'end_sib' => '</dd>'
);
$indent = 0;
foreach($lines as $line){
$evaluate_line = true;
$offset = 0;
while ($evaluate_line){
$position = null;
$mode = null;
foreach ($options as $mode_type => $option){
$loc = strpos($line, $option, $offset); //I'm pretty sure $offset is always 0 when it hits this line.
if ($loc !== false){
if (($position === null) || ($loc < $position)){
$position = $loc;
$mode = $mode_type;
}
}
}//End Options
if ($position !== null){
//echo "<p>$mode</p>";
$offset = ($position);
switch ($mode){
case 'down':
$indent++;
$replacement = '';
break;
case 'up':
$indent--;
$replacement = '';
break;
case 'new_sib':
$indent2 = $indent;
if ($indent2 < 0) $indent2 = 0;
$replacement = str_pad('',$indent2,':');
break;
case 'end_sib':
$replacement = '';
break;
}
$line = substr_replace( $line, $replacement, $offset, strlen($options[$mode]) );
$offset=0;
} else {
$evaluate_line = false;
// echo $line;
}
}//Endline
//$problems = array();
//$problems[strpos($line,$begin_parent)];
$new_text .= $line;
}
$wiki_text = $new_text;
$wiki_text = str_replace( '</p><p>', "\n\n", $wiki_text );
$wiki_text = str_replace( '<p>', '', $wiki_text );
$wiki_text = str_replace( "\n</p>", '', $wiki_text );
return $wiki_text;
}I break it up line-by-line just o to reduce overhead.
Then I scan for DL's and DD's, looking for which is "next." If I encounter a DL, I add one to the #of indents. (DL means you're going 'down' a level, <DL means you're going up one.) The DL and /DL tage themselves you can just erase then.
They each DD represents the start of a new line- which has to be indented with the correct number of :'s (which has been going up and down every time we wan into a DL or /DL.) So i replace the DD with, for example "::::" (four indents.) You can just erase the /DD's.
It sounds really stupid, but it works perfectly. My messy code is really just a reflection of the fact it has to be de-parsed using a state machine instead of stateless regular expressions. -Derik 09:37, 23 March 2009 (EDT)
Callbacks
Link callback
preg_replace_callback('/<a href="(.+(#.+)?)"( class="(.+)")? title="(.+)"( rel="(.+)")?>(.+)<\/a>/Ui' , 'fixLinks' );
function fixLinks($matches){
$url = $matches[1];
$anchor = $matches[2];
$specialClass = $matches[4];
$title = $matches[5];
$prettyText = $matches[8];
$returning = '';
if (($title.$anchor) == $prettyText){
$returning = "[[$prettyText]]";
}else{
//Text is different
$returning = "[[$title$anchor|$prettyText]]";
}
if ($specialClass == 'external text'){
$returning = "[$title $prettyText]";
}
return $returning;
}
/* ===========================================================================
Note that this will still return [[Energon cube|energon cube]] because it doesn't
treat the first character as caseless-- but the NEXT callback will fix that...
This function will also correctly parse Wikipedia links, Wookiepedia links, and
external links. Yay!
=============================================================================== */
preg_replace_callback('/\[\[(.+)\|(\1)\]\]/iU' , 'fixLinks2' ); //\1 is a mid-pattern callback to the first() subpattern. "If the text is the same withotu case, refer the link to the callback function."
function fixLinks2 ($matches) {
$one = $matches[1];
$two = $matches[2];
if ( substr($one,1) == substr($two,1)){
return '[[' . $two . ']]';
} else {
return '[[' . $one . '|' . $two . ']]';
}
} //Literally just compares substrings excluding the first character, LOL.
...all 3 of these came from, different script files, oddly enough. I don't have ONE script that does them all. ;)
Header callback
preg_replace_callback( '/<a name=".+"><\/a><h([123456])><span class="editsection">\[<a href=".+" title="Edit section:.+">edit<\/a>\]<\/span> <span class="mw-headline">(.+)<\/span><\/h[123456]>/Ui' , 'fixHeaders');
function fixHeaders($matches){
$padding = str_pad( '', $matches[1], '=');
return $padding . $matches[2] . $padding . "";
} //How many ='s? Just count the number on the tag! 1 for H1, 6 for H6.
Hope that's helpful. Is perl Ecmascript? -Derik 09:53, 23 March 2009 (EDT)
- It's from the same family tree as PHP and Ecmascript. The syntax is very similar, in fact! --abates 15:55, 23 March 2009 (EDT)
Faction template
Faction template is working, but someone (preferably not me,) needs to go in and fill in all the icons again or it's gonna start sprewing garbate all over page titles. Only a medium level of template competence is require,d it's all copy-and-paste.
- Go hereTemplate:Factions/icons.
- Starting with the Maximal/Predacon entries as a model, add all the other factions.
You can find all the names to use and the image files that go when them only slightly garbled in the table here; Template:Factions.
Just copy and paste as swap out the names. Volunteers? -Derik 12:44, 23 March 2009 (EDT)
- Well, now I know why I've never seen it working... why DOES it require JavaScript? It isn't hard to specify an absolute position to place in the header using CSS alone, surely? - SanityOrMadness 12:52, 23 March 2009 (EDT)
- <Raises hand> I'll do it! Shouldn't take me more than a couple hours, if that. --Tigerpaw28 13:57, 23 March 2009 (EDT)
And it's done. All the symbols in the list Derik linked to have been added to the template. I've verified that all the images are working with one exception: the Blendtron logo. I tried the filename listed on the linked list as well as the Blendtron page and neither shows up. Also, some of the mouseover text may need to be changed in order to restore Teh Funny. I can do that myself if someone can provide me with a list of what needs to be tweaked. --Tigerpaw28 16:32, 23 March 2009 (EDT)

