1

Re: Observations

Hi all,

Has anyone come across this before...

Having just imported a dataset (170,000+ moth records) I'm checking through the Observations window and I'm finding a large number of circumstances whereby samples made on the same date at the same location, by the same recorder, with the same sample/record type aren't being grouped together as a single observation.

So, instead of having, say, a single observation (same date/same location) with a list of x species for the same sample/record type as I'd expected, I'm getting the same x number of observations (same date/same location, same sample/record type) with only one species recorded under that observation. I've looked at my import data and I can see no reason why Recorder isn't grouping them under a single observation.

Any comments?

Best wishes,
Les.

Les Evans-Hill
Senior Data Officer
Butterfly Conservation, Butterflies for the New Millennium and National Moth Recording Scheme

2

Re: Observations

I'm having exactly the same problem also with a large dataset (imported from an Access table). This is a fault that was supposed to have been fixed in 6.10, but it is definitely still occurring for me in this case. Thing is, I've imported a selection of the same data from an Excel table and it is grouping the records correctly. I'm just now trying to analyse whether it's the data that's causing the problem or Recorder itself.

I'll keep you posted.

Charles Roper
Digital Development Manager | Field Studies Council
http://www.field-studies-council.org | https://twitter.com/charlesroper | https://twitter.com/fsc_digital

3

Re: Observations

Hi Charles,

I've just done the same and I concur your findings i.e. take a section of data that didn't group and re-imported it to a 'test' database; this data grouped fine.

I can't see anything (visible) in my Excel CSV data that the import process would regard it as requiring splitting into separate observations.

Best wishes,
Les.

Les Evans-Hill
Senior Data Officer
Butterfly Conservation, Butterflies for the New Millennium and National Moth Recording Scheme

4

Re: Observations

Hi

Charles - please let me know if you can't spot anything in Access file that would cause this and I'll raise with Dorset Software.

Many thanks,

Sarah Shaw
Biodiversity Information Assistant
JNCC

5

Re: Observations

Just to keep everyone in the loop, I've sent some sample data to Sarah. I've not been able to identify anything in the data itself that would be causing the problem, so it looks like it might be a fault in the IW.

Charles Roper
Digital Development Manager | Field Studies Council
http://www.field-studies-council.org | https://twitter.com/charlesroper | https://twitter.com/fsc_digital

6

Re: Observations

Just a quick update on this. I've now managed to get the data to import correctly after many, many, many tries. The solution was to split out the moth data I was importing into a separate import.

I eventually noticed that moths seemed to be the problem because they weren't collapsing, but other taxa were.

By importing the moths and the other records separately, everything collapsed appropriately under relevant samples. I've no idea why the mixed import is breaking, but it seems related to large numbers of moths. Perhaps because so many moth occurrences appear under one sample, as is often the case with light traps. I don't know, but there's obviously some obscure bug at work here.

Charles Roper
Digital Development Manager | Field Studies Council
http://www.field-studies-council.org | https://twitter.com/charlesroper | https://twitter.com/fsc_digital

7

Re: Observations

Charles,

Thanks for your work on this...

I still have the problem; I've just imported 50,000 moth records (in 2 hours 45 minutes) via an ADO Connection with the same result. If this is a 'bug', is it safe to assume the imported data has imported correctly into SQL Server i.e. the right tables, it's just the Recorder front-end that's making a meal of the observations by not creating the appropriate sample hierarchy?

I'm assuming if/when this 'bug' gets fixed, my Samples hierarchy in Observations will sort itself out and I won't have to re-import everything I've done so far i.e. start again from scratch. I've just received 870,000+ moth records from Hampshire, that's 47.85 hours to import the lot based on the above figures, without going through hoops to get it into the correct format (MapMate -> Recorder) - I really don't want to have to re-import this lot if I don't have to.

Sarah/Dorset Software,

Do you have any further thoughts re this issue?

Best wishes,
Les.

Les Evans-Hill
Senior Data Officer
Butterfly Conservation, Butterflies for the New Millennium and National Moth Recording Scheme

8

Re: Observations

Hi Les

Sorry to hear that you are having problems with this too.

Originally Charles sent me the import file but I was unable to reproduce the problem here - even with the large import the survey events/samples were created correctly. Myself and Charles have tried different variations - withholding columns and matching locations differently etc. but none of this seems to make a difference - Charles was still getting the wrong results and my import seemed ok.

Charles has now provided me with a full copy of his database and I too can reproduce the problem with this database.

So it appears the problem is database specific, but not specific to Charles' database!

The next step is to ask Dorset Software to take a look at problem, or perhaps suggest some checks that I can make here.

I'll keep you both updated,

Best wishes,

Sarah Shaw
Biodiversity Information Assistant
JNCC

9

Re: Observations

Hi Sarah,

If there is anything I can do to help, let me know.

Best wishes,
Les.

Les Evans-Hill
Senior Data Officer
Butterfly Conservation, Butterflies for the New Millennium and National Moth Recording Scheme

10

Re: Observations

Hi Les,

Unfortunately this isn't just a display thing otherwise it wouldn't be such a problem. New entries are actually being created in the survey_event and sample tables for every single record. I've sent Sarah some test data so hopefully the problem will be fixed in the next version. Unfortunately that won't help with anything you've imported already.

In theory, though, it shouldn't be too hard to create a routine that fixes things. Mike Weideli is coming down here next week and so I'll speak to him to see how easy it would be to create a temporary post-import patch script.

Charles Roper
Digital Development Manager | Field Studies Council
http://www.field-studies-council.org | https://twitter.com/charlesroper | https://twitter.com/fsc_digital

11

Re: Observations

Hi Charles,

A shame, I was hoping this was a display issue and not a direct problem with the way the data is written to the database during the import process. I'll stop importing 'live' data for the time being until I/we know for sure the issue can either be worked-around or a fix is imminent.

Until then, I'll do some further investigative work here.

Best wishes,
Les.

Les Evans-Hill
Senior Data Officer
Butterfly Conservation, Butterflies for the New Millennium and National Moth Recording Scheme

12

Re: Observations

Hi all,

Regretfully this issue I posted back in 2007 may still be a problem. Having just ran a couple of summary reports for my CEO (Events per Survey and Occurrences per Survey) a query was raised with respect to the number of occurrences per event which was very low at 4.

I checked some of the Surveys and unfortunately I found the same problem I originally reported with the correct grouping of events and occurrences. Can anyone else concur this with other datasets?

Les Evans-Hill
Senior Data Officer
Butterfly Conservation, Butterflies for the New Millennium and National Moth Recording Scheme

13

Re: Observations

Further to my post yesterday...

I've just ran a quick import test with 20 records for the same Event/Sample but split it into 2 imports. The result was Recorder created 2 identical Event/Samples, splitting the Occurrences between the two.

The workaround here is if you have to split your import into multiple imports, ensure your Events/Samples are in the same import segment. The sort order is important in your spreadsheet i.e. Date/Location/Taxon to ensure the correct grouping of Events and Samples during the import process - this may be good practice even if you don't split your import data?

I hope this makes some sense!

Les Evans-Hill
Senior Data Officer
Butterfly Conservation, Butterflies for the New Millennium and National Moth Recording Scheme

14

Re: Observations

Hi Les,
I think this was a conscious decision of the import tool, because to correctly detect the existing samples for any new data would mean running lots of queries against the main database during import and hence would slow down the import massively.
Best Wishes

John van Breda
Biodiverse IT

15

Re: Observations

Hi John,

I suspected just that scenario - thanks for confirming.

Les Evans-Hill
Senior Data Officer
Butterfly Conservation, Butterflies for the New Millennium and National Moth Recording Scheme