Learn, hack!

Hacking and security documentation: slides, papers, video and audio recordings. All in high-quality, daily updated, avoiding security crap documents. Spreading hacking knowledge, for free, enjoy. Follow on .

Edit This Page

URL
http://events.ccc.de/congress/2004/fahrplan/files/290-Scaling-the-Wiki-21C3-BrionVibber.pdf
File name
290-Scaling-the-Wiki-21C3-BrionVibber.pdf
File size
1.1 MB
MD5
ecea10315875fe3dd0f8183ea901d99d
SHA1
f4ed7b8e2b1a898456e9c95d713b076ed369f422

Wikipedia developers chronicle the evolution of the MediaWiki software and server farm to manage the popular and ever-growing editable encyclopedia. Planned scalability improvements are outlined, plus developers' conference and hacking session. Since 2001, the Wikipedia encyclopedia project has jumped from a scratchpad side project to one of the top 500 sites on the web (Alexa stats), bringing community and media attention to both wikis and open-licensed content. While some dispute its quality, Wikipedia's quantity is undeniable: at over one million pages in dozens of languages Wikipedia is the largest, most populous Wiki Wiki site on the net. Where traditional wikis have tended to be relatively small communities based on some topical interest, Wikipedia actively seeks attention, visitors, and editors with an open-ended mission to document virtually any topic, from the philosophers of Athens to Slashdot trolling fads. Phenomenal growth in the editing community, non-editing visitors, and the number of topics covered has thus put continual social and technical pressure on the scalability of the system. The wiki engine MediaWiki has grown up along with the project, following the sometimes-conflicting paths of being both easy to install and use and performing reasonably well in a multimillion hit per day environment. Built in the scripting language PHP, MediaWiki attempts to boost performance with "alternate hard and soft layers" of code: most hits to Wikipedia are actually handled by a Squid reverse proxy cache which is faster than any PHP script could be. Pre-parsed page chunks and dynamically editable user interface data are optionally cached using Livejournal's distributed memory object cache memcached or the compatible disk-backed tugelacache. Ongoing work is being put into accelerated native-code diff, parsing, and Unicode normalization modules, while still retaining compatibility with "pure" PHP code for use in more restricted environments. Additionally a MediaWiki developers' conference is planned, for discussion and work on the next major version of the software which will carry sites like Wikipedia through 2005 and beyond: developers will give a peek at upcoming storage back-end improvements to handle an ever-growing text corpus.

About us

Secdocs is a project aimed to index high-quality IT security and hacking documents. These are fetched from multiple data sources: events, conferences and generally from interwebs.

Statistics

Serving 8166 documents and 531.0 GB of hacking knowledge, indexed from 2419 authors from 163 security conferences.

Contribute

To support this site and keep it alive, you can click on the buttons below. Any help is really appreciated! This service is provided for free, but real money is needed to pay bills.

Flattr this Click here to lend your support to: Keep live SecDocs for an year and make a donation at www.pledgie.com !