Differences between revisions 58 and 59
Revision 58 as of 2013-03-31 15:52:46
Size: 3739
Comment:
Revision 59 as of 2013-03-31 15:59:17
Size: 5156
Comment:
Deletions are marked like this. Additions are marked like this.
Line 39: Line 39:
If you want to know precisely when a specific data source is updated, you can take a look at the [[http://udd.debian.org/crontabs.txt|crontab]] file. The ''timestamps'' table can tell you when a data source was last updated.
Line 41: Line 43:
You can <b>report bugs</b> against the qa.debian.org pseudo-package,
 
using the ''udd'' usertag and user ''qa.debian.org@packages.debian.org''.
You can <b>report bugs</b> against the qa.debian.org pseudo-package, using the ''udd'' usertag and user ''qa.debian.org@packages.debian.org''.
Line 50: Line 51:
 * [[http://udd.debian.org/udd.sql.gz|full dump of the database]] generated every two days. (500 MB and growing)
Line 54: Line 56:

== References ==
Even if its main goal was not to serve as a scientific resarch tool, UDD was already featured in two publications:
 * Lucas Nussbaum and Stefano Zacchiroli. '''The Ultimate Debian Database:
Consolidating Bazaar Metadata for Quality Assurance and Data Mining'''.
[[http://msr.uwaterloo.ca/msr2010/index.html|7th IEEE Working Conference on Mining Software Repositories (MSR'2010)]]
[[http://www.loria.fr/~lnussbau/files/msr2010-udd.pdf|Paper]] - [[http://www.loria.fr/~lnussbau/files/msr2010-udd-slides.pdf|Slides]] -
[[http://hal.archives-ouvertes.fr/inria-00502886/en|HAL]]

 * Julius Davies, Hanyu Zhang, Lucas Nussbaum and Daniel M. German.'''Perspectives on Bugs in the Debian Bug Tracking System'''
[[http://msr.uwaterloo.ca/msr2010/index.html|7th IEEE Working Conference on Mining Software Repositories (MSR'2010): Mining Challenge]]
[[http://www.loria.fr/~lnussbau/files/msr2010-debianbugs.pdf|Paper]] - [[http://www.loria.fr/~lnussbau/files/msr2010-debianbugs-slides.pdf|Slides]]- [[http://hal.archives-ouvertes.fr/inria-00502883/en|HAL]]

Translation(s): English - Italiano


Ultimate Debian Database (UDD) gathers a lot of data about various aspects of Debian in the same SQL database. It allows users to easily access and combine all these data.

Data currently being imported include: Packages and Sources files, from Debian and Ubuntu, Bugs from the Debian BTS, Popularity contest, History of uploads, History of migrations to testing, Lintian, Orphaned packages, Carnivore, Debtags, Ubuntu bugs (from Launchpad), Packages in NEW queue, DDTP translations.

For more information, please contact us on #debian-qa or debian-qa@lists.debian.org (mailing list subscription and archives).

Connecting to and using UDD

udd.debian.org is running on ullmann.debian.org. It accepts direct SSLed connections from quantz (qa) and wagner (alioth).

  • command-line:
    • psql service=udd

    • or: psql -U guest -h udd.debian.org -p 5452 udd

  • Python:

    import psycopg2
    conn = psycopg2.connect(host="udd.debian.org",port=5452,user="guest",database="udd")
    cursor = conn.cursor()
    cursor.execute("SELECT count(*) from sources where release='sid'")
    print cursor.fetchall()[0][0]
  • Ruby (DBI): require 'dbi' ; dbh = DBI::connect('DBI:Pg:dbname=udd;port=5452;host=udd.debian.org', 'guest') 

  • Ruby (PG): require 'pg'; conn = PG.connect({:host => 'udd.debian.org', :port => 5452, :user => 'guest', :dbname => 'udd'}) 

If you want to know precisely when a specific data source is updated, you can take a look at the crontab file. The timestamps table can tell you when a data source was last updated.

Improving UDD

If you want to help improve UDD, you can set up your own instance as described on UltimateDebianDatabase/Hacking. You can <b>report bugs</b> against the qa.debian.org pseudo-package, using the udd usertag and user qa.debian.org@packages.debian.org.

Credits

UDD started as a Google Summer of Code project by Christian von Essen (Neronus), co-mentored by Lucas Nussbaum, Stefano Zacchiroli and Marc 'HE' Brockschmidt. It is now mainly maintained by Lucas Nussbaum, with help from others.

Other resources

See also :

References

Even if its main goal was not to serve as a scientific resarch tool, UDD was already featured in two publications:

  • Lucas Nussbaum and Stefano Zacchiroli. The Ultimate Debian Database:

Consolidating Bazaar Metadata for Quality Assurance and Data Mining. 7th IEEE Working Conference on Mining Software Repositories (MSR'2010) Paper - Slides - HAL

  • Julius Davies, Hanyu Zhang, Lucas Nussbaum and Daniel M. German.Perspectives on Bugs in the Debian Bug Tracking System

7th IEEE Working Conference on Mining Software Repositories (MSR'2010): Mining Challenge Paper - Slides- HAL

SubPages