Feature #836

RCP: 0.7.5, CAH, the number of clusters tool bar spinner should adjust its maximum value according to the columns/rows current computing mode

Added by Sebastien Jacquot over 5 years ago. Updated 4 days ago.

Status:Feedback Start date:05/28/2014
Priority:Normal Due date:
Assignee:Sebastien Jacquot % Done:

50%

Category:UI Spent time: -
Target version:TXM 0.8.1

Description

The number of clusters tool bar spinner should adjust its maximum value according to the columns/rows current computing mode. At this moment, there's an R eval error, if the clusters number is higher than the number of entries.
The TBX CAH class should also implements some tests about this.

History

#1 Updated by Sebastien Jacquot over 5 years ago

  • Tracker changed from Bug to Feature
  • Category changed from Stats / R to UI
  • Status changed from New to Feedback
  • % Done changed from 0 to 50

The verification tests have been implemented in the UI.
The maximum number of clusters has been fixed as this : rows count - 1 and columns count -1, according to the computing mode (rows or columns) for CA and Lexical Table because FatoMineR can't compute a CAH where the cluster numbers is equals to the number of rows/cols. Is it normal ?

Other tasks to do:
- add a test, when the CAH command will be available from the Partition nodes, to check if the parts number is greater than 3 (a CAH need at least 4 parts ? Actually I think FactoMine R can compute 3 parts CAH dendrogram but can not generate the clusters, need to recheck this point)
- FactoMineR seems to manage a maximum of 16 clusters when plotting with plot.HCPC, it may be a FactoMineR bug linked to the plotting of inertia barplot
We can see when computing a CAH on rows that the barplot never exceeds 16 bars and trying to plot more than 16 clusters results in an error in R source code. After some tests plot.HCPC with argument tree.barplot=FALSE can exceed 16 clusters. A solution may be to plot the inertia barplot in another SWT component with plot.HCPC, as for the CA singular values.
The results generated by HCPC and used in CAH can contain more clusters than 16 but FactoMineR doesn't seems to manage to draw them with plot.HCPC. At this moment, I fixed the max clusters to 16 in the UI tool bar. We may investigate this and also limit the maximum clusters to 16 in the CAH preference page, if it's confirmed. NOTE: We could generate charts with more clusters directly form the HCPC result by using the JFC charts engine since it doesn't use plot.HCPC. In this case we need to find a way to define the max clusters count outside the CAH class, maybe at the charts engine levels, e.g. ChartsEngine.getMaxAvailableClusters(CAH ca) which could return 16 for the R implementation and the real maximum for the JFC implementation
- define if implement these verifications at CAH TBX level, before/during step computing, is useful

#2 Updated by Sebastien Jacquot about 5 years ago

  • Target version changed from TXM 0.7.7 to TXM 0.7.8

#3 Updated by Matthieu Decorde about 4 years ago

  • Target version changed from TXM 0.7.8 to TXM 0.8.0a (split/restructuration)

#4 Updated by Sebastien Jacquot over 1 year ago

  • Target version changed from TXM 0.8.0a (split/restructuration) to TXM 0.8.0

#5 Updated by Matthieu Decorde 8 months ago

  • Target version changed from TXM 0.8.0 to TXM 0.8.1

#6 Updated by Matthieu Decorde 4 days ago

  • Subject changed from RCP: 0.7.5, the number of clusters tool bar spinner should adjust its maximum value according to the columns/rows current computing mode to RCP: 0.7.5, CAH, the number of clusters tool bar spinner should adjust its maximum value according to the columns/rows current computing mode

Also available in: Atom PDF