| | |
| | |
Stat |
Members: 3643 Articles: 2'488'730 Articles rated: 2609
29 March 2024 |
|
| | | |
|
Article overview
| |
|
A pseudo-parallel Python environment for database curation | Eckhard Sutorius
; Johann Bryant
; Ross Collins
; Nicholas Cross
; Nigel Hambly
; Mike Read
; | Date: |
13 Nov 2007 | Abstract: | One of the major challenges providing large databases like the WFCAM Science
Archive (WSA) is to minimize ingest times for pixel/image metadata and
catalogue data. In this article we describe how the pipeline processed data are
ingested into the database as the first stage in building a release database
which will be succeeded by advanced processing (source merging, seaming,
detection quality flagging etc.). To accomplish the ingestion procedure as fast
as possible we use a mixed Python/C++ environment and run the required tasks in
a simple parallel modus operandi where the data are split into daily chunks and
then processed on different computers. The created data files can be ingested
into the database immediately as they are available. This flexible way of
handling the data allows the most usage of the available CPUs as the comparison
with sequential processing shows. | Source: | arXiv, 0711.2042 | Services: | Forum | Review | PDF | Favorites |
|
|
No review found.
Did you like this article?
Note: answers to reviews or questions about the article must be posted in the forum section.
Authors are not allowed to review their own article. They can use the forum section.
browser claudebot
|
| |
|
|
|
| News, job offers and information for researchers and scientists:
| |