Project

General

Profile

Statistics
| Revision:

# Date Author Comment
42584 19/05/2016 11:20 AM Claudio Atzori

added counters to keep track of the relationships provenance

42534 13/05/2016 10:50 AM Alessia Bardi

new tst for claim updates

42509 11/05/2016 06:46 PM Alessia Bardi

updated opentrial sample record

42501 11/05/2016 04:49 PM Claudio Atzori

excluding dateoftransformation from metadata fields, it should be serialised only in the record header

42499 11/05/2016 03:30 PM Alessia Bardi

Added dr:dateOfTransformation to some test XML files.
For publications dr:dateOfCollection must be set.
For datasets dri:dateOfCollections must be set.

42495 10/05/2016 07:17 PM Alessia Bardi

Testing OpenTrials dataset record mapping. Depending on snapshot parent.

42392 02/05/2016 03:21 PM Claudio Atzori

import cleanup

42391 02/05/2016 03:19 PM Claudio Atzori

reverting, we need less getters

42384 02/05/2016 02:53 PM Claudio Atzori

tests for dedup experiments

42383 02/05/2016 02:48 PM Claudio Atzori

added more getters

42382 02/05/2016 02:47 PM Claudio Atzori

dedup experiments

42381 02/05/2016 02:47 PM Claudio Atzori

added mapper class for hdfs actions

42362 02/05/2016 12:13 PM Claudio Atzori

cleanup

42247 15/04/2016 06:06 PM Claudio Atzori

added Mapper class PromoteActionSetFromHDFS

41764 18/03/2016 06:13 PM Claudio Atzori

added anchorStats map-only job

41681 15/03/2016 12:48 PM Claudio Atzori

added counter for DOIs

41649 10/03/2016 03:55 PM Claudio Atzori

removing useless counters

41648 10/03/2016 03:54 PM Claudio Atzori

using most recent dnet-pace-core features

41647 10/03/2016 03:54 PM Claudio Atzori

fixed DedupDeleteRelMapper

41646 10/03/2016 03:53 PM Claudio Atzori

do not export deleted entities

41645 10/03/2016 03:52 PM Claudio Atzori

adapted to the removal of contributors as relationships

41517 02/03/2016 06:37 PM Claudio Atzori

added utility methods to deal with strings rather than byte[]

41516 02/03/2016 06:31 PM Claudio Atzori

sort merged ids

41515 02/03/2016 06:30 PM Claudio Atzori

log the documents being compared before failing

41477 29/02/2016 03:44 PM Claudio Atzori

test for ARC

41468 29/02/2016 03:22 PM Claudio Atzori

introducing support for projects that doesn't provide a link to a specific fundingpath.

41070 27/01/2016 06:59 PM Claudio Atzori

implemented job and workflow to export the openaire identifiers

41055 27/01/2016 12:05 PM Claudio Atzori

log the number of items clustered on each key

41054 27/01/2016 11:59 AM Claudio Atzori

do not consider deleted entities

40341 11/12/2015 10:19 AM Alessia Bardi

New test for openaire2.0_data compliance for datasets

40331 10/12/2015 06:28 PM Claudio Atzori

updating to dnet-openaire-data-protos:3.5.0

40314 09/12/2015 06:13 PM Claudio Atzori

updated to dnet-openaire-data-protos:3.5.0-SNAPSHOT

40205 02/12/2015 06:13 PM Claudio Atzori

cleanup, extended tests to include new relationships and mapping profiles

40129 27/11/2015 04:19 PM Claudio Atzori

counters

40126 27/11/2015 03:30 PM Claudio Atzori

counter test

40063 20/11/2015 05:51 PM Alessia Bardi

Tests load gthe XSLT from the TDSRule profiles in dnet-openaireplus-profiles

40039 20/11/2015 03:43 PM Alessia Bardi

Back to revision r39888 and updated pom and sh files

39972 17/11/2015 05:40 PM Claudio Atzori

added possibility to post-process the result stored in the index documents

39888 12/11/2015 04:00 PM Alessia Bardi

ticket #1588 Rename "native" compatibility to "proprietary"

39623 19/10/2015 11:35 AM Michele Artini

use of external properties

39616 16/10/2015 05:21 PM Claudio Atzori

added min distance algorithm, used to identify the connected components (dedup)

39605 16/10/2015 04:34 PM Michele Artini

limit the job to insttitutional pubsrepository

39584 16/10/2015 09:43 AM Michele Artini

counter labels

39567 14/10/2015 11:58 AM Michele Artini

use of Text instead of ImmutableBytesWritable

39562 13/10/2015 04:56 PM Michele Artini

reimplemented calculatePersonDistribution M/R job to consider only the results from pubsrepositories (not journals)

39524 09/10/2015 12:18 PM Claudio Atzori

reuse the same outkey and outvalue objects

39431 01/10/2015 11:07 AM Claudio Atzori

added more mapping tests, using xslt picked from services.openaire

39297 18/09/2015 04:18 PM Claudio Atzori

spring makes me lazy

39290 18/09/2015 03:07 PM Claudio Atzori

added infospace dump mapper

39275 18/09/2015 09:19 AM Claudio Atzori
39222 14/09/2015 06:06 PM Claudio Atzori

added information space export job

38951 02/09/2015 12:58 PM Alessia Bardi

testing umlauts

38950 02/09/2015 12:58 PM Alessia Bardi

testing umlauts

38835 28/08/2015 10:24 AM Claudio Atzori

cleanup

38834 28/08/2015 10:24 AM Claudio Atzori

updated to the new mongodb driver specs

38692 21/08/2015 12:42 PM Alessia Bardi

Null values for FP7 and H2020 specific fields about OA mandate and Data Pilot.

38671 05/08/2015 06:13 PM Alessia Bardi

Do not check the status of a record: we assume we have to insert it because the OAI store is built in refresh mode.

38665 04/08/2015 05:33 PM Alessia Bardi

OAIStore with compressed bodies. FCurrently for beta only.

38586 29/07/2015 05:23 PM Claudio Atzori

fixed tests, added new dedup specific jobs

38374 20/07/2015 04:59 PM Claudio Atzori

added implementors for offline dedup person workflow

38324 17/07/2015 03:02 PM Claudio Atzori

cleanup

38322 17/07/2015 11:54 AM Claudio Atzori

cleanup