Return to ENVRI Community Home![]()
1.Data processing desiderata: input
i. What data are to be processed? What are their:
[can’t answer
ii. How is the data made available to the analytics phase? By file, by web (stream/protocol), etc.
[can’t answer]
iii. Please provide concrete examples of data.
[can’t answer]
2. Data processing desiderata: analytics
i. Computing needs quantification:
[can’t answer]
ii. Process implementation:
○ Programming languages?
○ Platform?
○ Specific software requirements?
[can’t answer]
[Answers here]
[Answers here]
[Answers here]
iii. Do you use batch or interactive processing?
[Answers here]
iv. Do you use a monitoring console?
[Answers here]
v. Do you use or black box or workflow processing?
[Answers here]
vi. Please provide concrete examples of processes to be supported/currently in use;
[Answers here]
3. Data processing desiderata: output
i. What data are produced? Please provide:
[Answers here]
ii. How are analytics outcomes made available?
[Answers here]
4. Statistical questions
i. Is the data collected with a distinct question/hypothesis in mind? Or is simply something being measured?
[Answers here]
ii. Will questions/hypotheses be generated or refined (broadened or narrowed in scope) after the data has been collected? (N.B. Such activity would not be good statistical practice)
[Answers here]
5. Statistical data
i. Does the question involve analysing the responses of a single set of data (univariate) to other predictor variables or are there multiple response data (bi or multivariate data)?
[Answers here]
ii. Is the data continuous or discrete?
[Answers here]
iii. Is the data bounded in some form (i.e. what is the possible range of the data)?
iv. Typically how many datums approximately are there?
[Answers here]
6. Statistical data analysis
i. Is it desired to work within a statistics or data mining paradigm? (N.B. the two can and indeed should overlap!)
[Answers here]
ii. Is it desired that there is some sort of outlier/anomaly assessment?
[Answers here]
iii. Are you interested in a statistical approach which rejects null hypotheses (frequentist) or generates probable belief in a hypothesis (Bayesian approach) or do you have no real preference?
[Answers here]
| Go-between | @Yin Chen |
|---|---|
| RI representative | Jon Borre Orbek, Angelo Viola, Vito Vitale |
| Period of requirements collection | Aug 2015- Jan 2016 |
| Status |