Comments (2)
Is this because multiple copies show up during the AD migration phase? (cache to permanent storage)
from storage-inventory.
First thing to fix:
The Iterator must be in order of StorageMetadata (can't believe I missed this earlier)... so the query needs an ORDER BY. I would suggest logging into sybase and looking at the query plan to see if "ORDER BY uri" or "ORDER by archive, filename" (column names?) is different and which is better. This would not effect an initial bulk load but will cause a subsequent run of tantar to fail horribly because it does a merge join of two iterators and relies on ordering.
Two possible fixes for the duplicate issue:
- since there is an ORDER BY (!) it might be OK to do SELECT DISTINCT: Check query plan with and without to see if it changes in a meaningful way.
- since there is an ORDER BY, a duplicate StorageMetadata will be next() to the identical one so the iterator could track the previous returned value and skip duplicates... I'd probably do this because sending extra rows from sybase to the java code is cheap and it has to work.
from storage-inventory.
Related Issues (20)
- minoc: make service work if renamed at deploy time
- raven: make service work if renamed at deploy time HOT 1
- luskan: self-lookup is an anti-pattern HOT 1
- raven: add support for files API params
- tantar: dryrun mode doesn't produce any output HOT 1
- raven: does not work without grant providers HOT 2
- model change: StorageSite
- critwall: use Protocol.securityMethod to decide how to authenticate download calls
- cadc-storage-adapter-swift should support a pool of clients HOT 1
- raven: should not hand out anon URLs for resouyrces that are not accessible anonymously
- baldur: should verify the namespace syntax when reading baldur.properties
- baldur: permission rules should match raven style HOT 1
- cadc-storage-adapter-fs: remove hard-coded assumption of md5 file checksum
- cadc-storage-adapter-swift: remove hard-coded assumption of md5 file checksum
- cadc-storage-adapter: add stale transaction cleanup API HOT 1
- logControl permissions hard-coded: luskan, raven, minoc HOT 1
- NPE in swift adapter when reverting a file segment
- `minoc` metadata updates are not propagated to all storage locations HOT 1
- minoc: check If-Modified-Since request header
- cadc-storage-adapter-fs clarification HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from storage-inventory.